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HEPARIN BINDING MITOGEN WITH HOMOLOGY TO EPIDERMAL GROWTH FACTOR (EGF) 

Background Q f yne Invention 
This application is a continuation-in-part of 
5 Klagsbrun et al., U.S. S.N. 07/598,082, filed October 16, 
1990. 

This invention was made with Government support 
under #R37CA37392 awarded by the National Institute of 
Health. The government has certain rights in the 

io invention. 

This invention relates to growth factors. 
Growth factors play a central role in mediating 
cell proliferation and differentiation, for example, 
basic fibroblast growth factor (bFGF) , platelet-derived 

15 growth factor (PDGF) , transforming growth factor-alpha 

(TGF-ot) , and transforming growth factor-beta (TGF-/3) have 
been implicated in the proliferation of connective tissue 
cells and the induction of angiogenesis characteristic of 
wound repair (Van Brunt and Klausner, 6 Biotechnology 25, 

20 1988). In addition, Agel et al. (146 J. Pathol. 197, 
1985) suggest that growth factors are involved in the 
etiology of atherosclerosis; for example, the smooth 
muscle cell (SMC) hyperplasia that accompanies 
atherosclerosis has been attributed to PDGF, a potent SMC 

25 mitogen. 

Heparin affinity chromatography has been used 
extensively for purifying and characterizing a variety of 
these growth factors. Acidic FGF (aFGF) and basic FGF 
(bFGF) bind to immobilized heparin columns and are eluted 

30 with 1.0M to 1.2M NaCl and 1.5M to 1.8M NaCl, 

respectively (Folkman and Klagsbrun, 235 Science 442, 
1987; Lobb et al., 261 J. Biol. Chem. 1924, 1986). 
Several growth factors which are structurally homologous 
to aFGF and bFGF also have an affinity for heparin (see, 
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for example, Rubin et al., 86 Proc. Natl. Acad. Sci. USA 
802, 1989). PDGF binds to immobilized heparin, but with 
relatively low affinity, being eluted with only 0.5M 
NaCl. Epidermal growth factor (EGF) does not bind 

5 heparin to any substantial extent under the conditions 
described in the cited references on heparin binding 
growth factors. Lobb et al. (261 J. Biol. Chem. 1924, 
1986) report the partial purification by heparin affinity 
of two classes of growth factors mitogenic for 

0 endothelial cells. Gospodarowicz et al. (81 Proc. Natl. 
Acad. sci. USA 6963, 1984) report the use of heparin 
affinity in the purification of bovine brain and 
pituitary fibroblast growth factors. Shing et al. (29 J. 
Cell Biochem. 275, 1985) report a chondrosarcoma-derived 

5 growth factor purified by heparin-Sepharose affinity 
chromatography and Bio Rex 70 cation exchange 
chromatography. Bohlen et al. (185 FEBS Lett. 177, 1985) 
report a fibroblast growth factor, derived from human 
brain, which is purified by cation-exchange 

o chromatography, heparin-Sepharose affinity, and reverse- 
phase HPLC. Shing et al. (223 Science 1296, 1984) report 
a heparin-binding tumor cell-derived capillary 
endothelial cell factor.. Besner et al. (107 J. Cell 
Biol. 481a, 1988) report the detection of a heparin- 

5 binding, mononuclear cell-derived growth factor (s) which 
is cationic, is of 6000-14,000 MW, is inactivated by heat 
(100°C, 10 min) , is inactivated by dithiothreitol (5mM) , 
and is resistant to incubation with 4M guanidine or 0.1M 
HC1. 

0 Sianmary of the Invention 

In one aspect, the invention generally features a 
novel growth factor which we have termed heparin binding 
EGF-homologous mitogen (HB-EHM) . By "EGF homologous" is 
meant having a segment structurally related to epidermal 

5 growth factor in that it contains six cysteine residues 
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spaced in a manner characteristic of any of the EGF 
family of proteins (e.g. , human amphiregulin (AR) , Shoyab 
et al. f 243 Science 1074, 1989; human TGF-a , Derynck et 
al., 38 Cell 287, 1984; and human EGF, Gregory, 257 
5 Nature 325, 1975), which participates in binding as 
described below to one or more of the EGF family of 
receptors (e.g., on A-431 cells or smooth muscle cells). 
In general, an EGF-homologous segment renders the protein 
including such a domain at least partially resistant to 

10 heat (e.g., following exposure to a temperature of 90° 
for 5 minutes) and sensitive to dithiothreitol (DTT) 
(e.g., following exposure to 5mM DTT for 2 hours). 
"Heparin binding" means having a specific affinity for 
heparin (i.e., an affinity beyond that predicted only by 

15 ionic interactions) as evidenced by binding to heparin at 
NaCl concentrations above those which elute similarly 
charged proteins having no specific affinity. In 
general, heparin binding factors remain bound to heparin 
up to NaCl concentrations of at least 0.6M (most 

20 preferably, at least 0.9M). 

In a second aspect, the invention features 
polypeptides which bind heparin, which include an EGF- 
homologous segment, and which stimulate growth of 
fibroblast cells, epithelial cells, and smooth muscle 

25 cells, but not endothelial cells. 

In preferred embodiments, these polypeptides are 
human HB-EHM, and, more preferably, they include a 
characteristic EGF-homologous segment sequence 
(substantially, amino acids 108 to 143 of SEQ ID N0:1: C 

30 LRKYKDFCIHGECKYVKELRAPSCICHPG 
Y H G E R C) . One particular such peptide includes the 
above described EGF-homologous segment and all or part of 
amino acids 1 to 208 of SEQ ID NO:l: 

MKLLPSVVLKLFLAAVLSALVTGESLERL 
35 RRGLAAGTSNPDPPTVSTDQLLPLGGGRD 
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RKVRDLQEADLDLLRVTLSSKPQALATPN 
KEEHGKRKKKGKGLGKKRDPCLRKYKDFC 
IHGECKYVKELRAPSCICHPGYHGERCHG 
LSLPVENRLYTYDHTTILAVVAVVLSSVC 
5 LLVIVGLLMFRYHRRGGYDVENEEKVKLG 
M T N S H. 

In a mature form, HB-EHMs may have an amino-terminus 
between aspartic acid residue 63 and alanine residue 82 
(as shown in Fig. 3, SEQ ID NO:l) and a carboxy-terminus 

io between serine residue 147 and proline residue 149 (also 
as shown in Pig. 3, SEQ ID NO:l). The invention also 
encompasses smaller polypeptides, for example, those 
having an amino-terminus between arginine residue 73 and 
alanine residue 82 (as shown in Fig. 3, SEQ ID NO: 1) and 

15 a carboxy-terminus at serine residue 147 (also as shown 
in Fig. 3, SEQ ID NO: 1). These polypeptides are 
preferably acid stable. The isolated polypeptide may 
contain a sequence substantially identical to the amino 
acid sequence shown in Fig. 1 (amino acids 82 to 147 in 

20 SEQ ID N0:l) or may contain a sequence substantially 
identical to the amino acid sequence shown in Fig. 4 
(amino acids 63 to 148 in SEQ ID NO: l) . The 
polypeptides according to the invention are preferably 
cationic, may have a pi of between 7.2 and 7.8 (e.g., 

25 when produced by a eukaryotic cell) , and may be, but need 
not be, glycosylated. Preferred polypeptides according 
to the invention have an apparent molecular weight of 
approximately 22,000 on a non-reducing polyacrylamide gel 
and include at least 66 amino acid residues. In 

30 addition, these polypeptides, preferably, are 

sufficiently isolated from other co-purifying substances 
to be suitable for therapeutic use. 

In other aspects, the invention features: purified 
nucleic acid which encodes the polypeptides described 

35 above; vectors (preferably, pMTN-HBEGF, pAX-HBEGF, pNA28, 
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and pNA5l) which direct expression of this nucleic acid 
in eukaryotic (preferably, mammalian) or prokaryotic 
(preferably, Escherichia coli, most preferably, E. coli B 
or E. coli W3110) cells. The invention also features 
5 cells containing such vectors; such cells may be 

eukaryotic cells (for example, mammalian cells capable of 
secreting a mature form of the protein into the growth 
medium) or prokaryotic cells which are capable of 
producing a polypeptide which includes the primary 

10 sequence of a mature form of the protein and which may 
also include additional amino acids at the amino or 
carboxy terminus which facilitate improved expression, 
stability, or ease of isolation of the HB-EHM. The 
expression vectors or vector-containing cells of the 

15 invention can be used in a method of the invention to 
produce HB-EHM and equivalent polypeptides. These 
polypeptides can be used in a method of the invention for 
healing a wound in a patient, involving applying to the 
wound a wound healing amount of the polypeptides 

20 described above. The polypeptides can also be used in a 
method of the invention for the in vitro culturing of a 
cell whose proliferation is stimulated by HB-EHM, 
preferably, fibroblasts , epithelial cells, or smooth 
muscle cells, involving contacting the cells with a 

25 growth-stimulatory amount of the polypeptides described 
above. The polypeptides can further be used to produce 
an antibody which preferentially binds to the 
polypeptides. The antibody is preferably monoclonal and 
neutralizes the in vivo biological activity of the 

30 polypeptides described above. 

In a final aspect, the invention features a method 
for identifying an antagonist to HB-EHM, involving 
providing HB-EHM to cultured cells whose growth is 
stimulated by this factor, in the presence of a candidate 

35 antagonist, and determining whether the candidate 



WO 92/06705 



PCT/US91/07691 



- 6 - 

antagonist is capable of blocking HB-EHM- induced growth 
of the cells. 

By "mature" is meant the protein in one of its 
processed extracellular forms. By "biological activity" 

5 is meant the ability of the factor to stimulate the 

growth of cells (e.g., fibroblasts, epithelial cells, or 
smooth muscle cells, but not endothelial cells) assayed, 
e.g., using the methods described below. By "isolated" 
is meant removed from its naturally-occurring environment 

10 and preferably produced as a homogeneous solution by 
standard biochemical or recombinant DNA techniques. By 
"acid stable" is meant retaining biological activity (as 
described above) following, e.g., exposure to a solution 
at pH 2.5 for 2 hours. By "glycosylated" is meant having 

15 one or more covalently-linked carbohydrate moieties. By 
"un-glycosylated" is meant lacking covalently-linked 
carbohydrate moieties. By "apparent molecular weight" is 
meant the molecular weight, determined on a denaturing 
polyacrylamide gel, by comparison with standards, e.g., 

20 protein standards, of known molecular weight. By "non- 
reducing polyacrylamide gel" is meant a polyacrylamide 
electrophoretic gel lacking a reducing agent such as jS- 
mercaptoethanol. By "healing a wound" is meant, without 
limitation, stimulating tissue repair or blood vessel 

25 development. By an "antagonist" is meant a molecule 

which inhibits an activity of a factor, in this case, HB- 
EHM, such as the ability to stimulate the growth of 
responsive cells (e.g., fibroblasts, epithelial cells, 
and smooth muscle cells) . By a "substantially identical" 

30 amino acid sequence is meant an amino acid sequence which 
differs only by conservative amino acid substitutions, 
for example, substitution of one amino acid for another 
of the same class (e.g., valine for glycine, arginine for 
lysine, etc.) or by one or more non-conservative amino 

35 acid substitutions, deletions, or insertions located at 
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positions of the amino acid sequence which do not destroy 
the biological activity of the growth factor (as 
described above) . Such equivalent growth factors can be 
isolated by extraction from the tissues or cells of any 
5 animal which naturally produce such a factor or which can 
be induced to do so, using the methods described below, 
or their equivalent; or can be isolated by chemical 
synthesis; or can be isolated by standard techniques of 
recombinant DNA technology, e.g., by isolation of cDNA or 
10 genomic DNA encoding such a growth factor. By a 

"substantially identical" nucleic acid sequence is meant 
a nucleic acid sequence which encodes a substantially 
identical amino acid sequence (i.e., one which is 
identical or which differs only by conservative amino 
15 acid substitutions, non-conservative amino acid 

substitutions, or deletions or insertions of amino acid 
sequence which do not destroy the biological activity of 
the factor, as described above). Such nucleic acid 
sequences may be isolated, without limitation, by 
20 standard techniques of recombinant DNA technology (e.g., 
by isolation of cDNA or genomic DNA, or by in vitro 
mutagenesis, by polymerase chain reaction methodology or 
by chemical synthesis) . By "neutralize" is meant to 
partially or completely block (e.g., the biological 
25 activity of the growth factor) . 

This invention includes the growth factors, as 
translated (e.g., the 66 amino acid form of HB-EHM which 
include the amino acids shown in Fig. 1, or the 86 amino 
acid form of the HB-EHM shown in Fig. 4, or the 208 amino 
30 acid form of the protein shown in Fig. 3) as well as any 
growth-promoting forms of the protein which have 
undergone post-translational modification or processing. 
Such post-translational modification may include, without 
limitation, a processed amino-terminus, for example, 
35 removal of all or part of a signal sequence or all or 
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part of a pro sequence; a processed car boxy-terminus, for - 
example, removal of all or part of a membrane-spanning 
domain or all or part of a cytoplasmic domain; o- linked 
glycosylation; or any combination, thereof. Moreover, 
5 this invention is not limited to the amino acid or 
nucleic acid sequences provided in Figs, l, 3, or 4; 
those of ordinary skill in the art can readily isolate 
equivalent or substantially identical growth factors or 
nucleic acid sequences encoding equivalent or 

10 substantially identical growth factors using the methods 
described herein. 

The growth factors of this invention play a role 
in the proliferative responses characteristic of wound 
repair, growth and development, atherosclerosis, 

15 neoplasia and myelofibrosis, and are therefore useful for 
stimulating the growth of cultured vertebrate cells, for 
enhancing the healing of wounds, and for detecting and 
treating atherosclerosis and neoplastic diseases. 

Other features and advantages of the invention 

20 will be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 



Description of the Preferred Embodiments 
The drawings will first briefly be described. 
Drawings 

25 FIG. 1 is a representation of an amino acid 

sequence which is present in one form of mature (i.e., 
processed extracellular) heparin binding EGF-homologous 
mitogen. 

FIG. 2 is a representation of the amino acid 
30 sequences of some EGF-homologous segments. 

FIG. 3 is a representation of the nucleic acid 
sequence of a HB-EHM-encoding cDNA and the deduced amino 
acid sequence of the primary translation product. 
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FIG. 4 is a representation of an amino acid 
sequence which is present in at least one form of mature 
(i.e., processed extracellular) heparin binding EGF- 
homologous mitogen. 
5 FIG. 5 is the nucleic acid and deduced amino acid 

sequence of the fusion protein chloramphenicol 
acetyltransferase (CAT) - HB-EHM. 

Derivation of Deposited Materials 

Materials have been deposited with the American 

io Type Culture Collection (ATCC) in Rockville, MD, which 
allow others skilled in the art to readily obtain the 
materials of this invention. The derivation of these 
deposited materials is described below. The growth 
factor produced by these deposited materials is exemplary 

15 of, not limiting to, this invention; those of ordinary 
skill in the art can readily isolate equivalent growth 
factors, nucleic acid encoding such growth factors, and 
antibodies which preferentially bind to such growth 
factors using the methods described below. 

2° Isolation of Lymphoma Factor and Amino Acid Sequence 
One source of heparin binding EGF-homologous 
mitogen (HB-EHM) is the human histiocytic lymphoma cell 
line U-937, available from the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville, MD 20852 

25 (Accession No. CRL 1593) . HB-EHM was isolated from this 
cell line as follows. U-937 cells were plated at 1-2 x 
10 8 cells/T-150 flask (Costar, Cambridge, MA) in RPMI 
1640 (GIBCO, Grand Island, NY) supplemented with 10% 
fetal calf serum (GIBCO, Grand Island, NY) and 

30 antibiotics (loo units/ml penicillin and 100 ug/ml 

streptomycin sulfate, GIBCO, Grand Island, NY) . Cells 
may be treated with 60nM phorbol myristate acetate (PMA; 
also known as 12-O-tetradecanoyl phorbol-13 -acetate; or 
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TPA) for 24 hours at 37° to facilitate attachment of the 
cells to the tissue culture flask; however, this step is 
not necessary for HB-EHM production. Cells were washed 
twice with phosphate-buffered saline, and the medium was 
5 replaced with serum-free RPMI 1640 supplemented with 
antibiotics. After 72 hours at 37°, culture fluids were 
collected and replaced with the same amount of fresh 
serum-free medium, and the cells were again incubated for 
72 hours until the next medium change. Cells were 

10 maintained in this way for 10-14 days, with continuous 
secretion of growth factors. Conditioned medium (CM) was 
collected every 3 days. The collected medium was 
centrifuged for 10 minutes at 10,000 rpm in a GS-3 
Sorvall rotor, and benzamidine hydrochloride (Sigma, St. 

15 Louis, HO) was added to the supernatant to a final 
concentration of 1 mM to protect against protease 
degradation. The supernatant was stored at -20°c until 
use. 

Alternatively, conditioned medium was prepared by 

20 washing the PMA-treated cells with phosphate-buffered 
* saline and replacing the medium with serum-free RPMI 1640 
medium (as described above) . The conditioned medium was 
then collected and replaced with fresh serum-free medium 
approximately 24, 48, 72, 120, and 168 hours later (i.e., 

25 after PMA treatment) . 

Conditioned medium was assayed for growth factor 
activity directly, as described. below, using either 
fibroblasts (i.e., BALB/c mouse 3T3 cells), epithelial 
cells (i.e., human keratinocytes) , or smooth muscle cells 

30 (i.e., bovine aortic smooth muscle cells, BASMC) . 

Alternatively, CM was first fractionated by fast protein 
liquid chromatography (FPLC, Pharmacia, Piscataway, NJ) 
by applying 500 ml of the CM to a TSK-heparin 5PW column 
(8 x 75 mm, TOSOHAAS, Philadelphia, PA) . The column was 

35 washed with 10 column volumes of equilibration buffer 
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(0.2M NaCl, 0.01M Tris-HCl, pH 7.5) , and bound protein 
was eluted with a 40 ml linear gradient of 0.2 - 2M NaCl 
in 0.01M Tris-HCl r pH 7.4 at 1 ml/min. Fractions (2.5 
ml) were tested for growth factor activity by measuring 
5 the effect of aliquots of the fractions on DNA synthesis 
in, or the proliferation of, BALB/c mouse 3T3 cells; such 
activity was monitored by assaying the incorporation of 
[ 3 H] -thymidine into DNA and/ or by measuring the increase 
in cell number. Measurement of DNA synthesis in BALB/c 

10 mouse 3T3 cells was performed as described by Shing et 
al. (223 Science 1296, 1984, hereby incorporated by 
reference) . One unit of BALB/c mouse 3T3 stimulation 
activity was defined as the amount of growth factor 
required to stimulate half-maximal DNA synthesis in 

15 BALB/c mouse 3T3 cells under the conditions described in 
Shing et al. ( supra ) . To measure DNA synthesis in bovine 
capillary endothelial cells, cells were trypsinized and 
re-plated sparsely (1 x 10* cells/well) with 400 nl of 
Dulbecco's modified Eagle's medium (DMEM, 6IBC0, Grand 

20 Island, NY) supplemented with 10% bovine calf serum 

(GIBCO, Grand Island, NY) and antibiotics (100 units/ml 
penicillin and 100 jtig/ml streptomycin sulfate, GIBCO, 
Grand Island, NY) in 48-well plates (Costar, Cambridge, 
MA) . After 24 hours of incubation, the medium was 

25 replaced with the same volume of DMEM supplemented with 
2% bovine calf serum, antibiotics, l pVl thymidine and 
0.5% bovine serum albumin. After 24 hours of incubation, 
the test samples were added to the wells. Eighteen hours 
later, [ 3 H] -thymidine (0.6 uCi/well, New England Nuclear, 

30 Wilmington, DE) was added. After an additional 5 hours, 
each plate was treated by the same procedure as used to 
harvest the BALB/c mouse 3T3 cells to measure the 
incorporation of [ 3 H] -thymidine into the DNA of the 
cells. To measure DNA synthesis in bovine aortic smooth 

35 muscle cells (from Dr. H. Weich, Children's Hospital, 
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Boston) , the cells were trypsinized and re-plated 
sparsely (1 x 10* cells/well) with 400 Ml of DMEM 
supplemented with 10% Colorado calf serum (Colorado Serum 
Company) and antibiotics (100 units /ml penicillin and 100 
5 /ig/ml streptomycin sulfate, GIBCO, Grand Island, NY) in 
48-well plates. After the cells were grown to 
confluence, the medium was replaced with the same volume 
of DMEM supplemented with 2% Colorado calf serum and 
antibiotics (as described above) . After 24 hours of 

10 incubation, the test samples were added to the wells. 
Eighteen hours later, [ 3 H] -thymidine (l uCi/well) was 
added. After an additional 6 hours, the plates were 
treated by the same procedure used to harvest the BALB/c 
mouse 3T3 and capillary endothelial cells to measure the 

15 incorporation of [ 3 H] -thymidine into the DNA of the 
cells. 

Bovine aortic smooth muscle cell proliferation was 
also assayed by counting cell number following growth 
factor stimulation. In these experiments, cells were 

20 plated sparsely in 24 well plates in DMEM/ 10% bovine calf 
serum/1% GPS (100% GPS is glutamine 29.2 mg/ml, 
penicillin 10,000 units/ml, streptomycin sulfate 10,000 
Mg/ml; GIBCO, Grand Island, NY) at 10 4 cells/well. Test 
samples were then added and, after three days, cells were 

25 removed by trypsinization and counted in a Coulter 
counter (Coulter, Hialeah, FL) . 

HB-EHM was purified as follows. Growth factor 
activity was monitored during the purification procedures 
using BALB/c mouse 3T3 cells as target cells and 

30 measuring [ 3 H] -thymidine incorporation into DNA. 

Conditioned medium (8-10L) was applied directly to a Bio- 
Rex 70 cation exchange column (5 x 10 cm, 200 ml) 
(BioRad, Hercules, CA) equilibrated with 0.2M NaCl, 0.01M 
Tris-HCl, pH 7.5, at a flow rate of 300 ml/hour. The 

35 column was washed extensively with the equilibration 
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buffer, and the bound protein was eluted with 1M NaCl, 
0.01M Tris-HCl, pH 7.5. The biologically active 
fractions (as assayed above using BALB/c mouse 3T3 cells) 
were adjusted to pH 8.0 and applied to a copper-chelating 
5 Sepharose column (2 x 11 cm, Pharmacia LKB Biotechnology 
Inc. , Piscataway, NJ) saturated with cupric sulfate and 
equilibrated with 0.5M NaCl, 0.01M Tris-HCl, pH 8.0. 
After washing extensively with the equilibration buffer, 
bound proteins were eluted with a 200 ml linear gradient 

io of 0 - 0.04M L-histidine in 0.5M NaCl, 0.01M Tris-HCl, pH 
8.0, at a flow rate of 40 ml /hour. Using this method, a 
single bioactive peak was eluted by 0.02M to 0.025M L- 
histidine. The biologically active fractions included in 
this peak of activity were pooled, diluted 1:1 with 0.01M 

15 Tris-HCl, pH 7.5, and applied to a TSK-heparin 5PW FPLC 
column (8 x 75 mm, TOSOHAAS, Philadelphia, PA) 
equilibrated with 0.2M NaCl, 0.01M Tris-HCl, pH 7.5. The 
column was washed with 20 ml of 0.2M NaCl, 0.01M Tris- 
HCl, pH 7.5, and bound protein was eluted with a 40 ml 

20 linear gradient of 0.2 - 2M NaCl in 0.01M Tris-HCl, pH 
7.5, at a flow rate of 1 ml/min. 

A partially purified HB-EHM sample prepared by the 
three step purification outlined above (and eluting from 
the TSK-heparin column at 1-1.2M NaCl) was applied to a 

25 C 4 -reversed phase high performance liquid chromatography 
column (RP-HPLC) . A Beckman model 334 HPLC system was 
used (Beckman instruments, Inc., Somerset, NJ) . The 
sample was loaded onto the C 4 -reversed phase HPLC column 
(4.6 x 250mm, Vydac) after equilibration of the column 

30 with 5% acetonitrile, 0.1% trifluoroacetic acid. The 
column was washed with this solvent extensively, and 
bound proteins were eluted with a 5 ml gradient of 5% - 
15% acetonitrile, 0.1% trifluoroacetic acid followed by a 
120 ml gradient of 15% - 40% acetonitrile, 0.1% 

35 trifluoroacetic acid at a flow rate of 1 ml/min. 
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Fractions were collected, and aliquots diluted 1 to 10 in 
1% bovine serum albumin (Sigma, St. Louis, MO) in 
phosphate-buffered saline and analyzed for growth factor 
activity. Eluting protein was detected by monitoring 
5 absorbance at 214 nm. 

A single predominant peak of growth factor 
activity was eluted from the C 4 column with approximately 
23% acetonitrile. This peak of activity corresponded to 
two peaks of growth factor absorbance at 214 nM (i.e., a 

10 peak which eluted at 33 minutes and a peak which eluted 
at 34.5 minutes). Fractions were collected that 
corresponded to the first of these two peaks (i.e., the 
33 minute peak) and were further analyzed. The purified 
protein migrated as a single band with an apparent 

15 molecular weight of about 22 kD under non-reducing 

conditions (i.e., on a 15% polyacrylamide/SDS gel) and an 
apparent molecular weight of 20 kD under reducing 
conditions (i.e., on a 15% polyacrylamide/SDS gel 
containing 1% p-mercaptoethanol) . The apparent molecular 

20 weight of the growth factor was determined by comparing 
the electrophoretic migration of the factor with the 
electrophoretic migration of proteins of known molecular 
weight. 

As measured by the factor's growth-stimulatory 
25 activity (assayed as [ 3 H] -thymidine incorporation into 
Balb/c mouse 3T3 cellular DNA by methods described 
above) , this purified form of HB-EHM was resistant to 
exposure to pH 2.5 for 2 hours and not destroyed by 
heating to 90 °C for 5 minutes but was destroyed 
30 completely by exposure to 5mM DTT for 2 hours. The 

activity was also not lost after acid treatment with 0.1M 
glycine-HCl, pH 2.5 for 2 hours. 

In all of these procedures, tubes siliconized by 
Sigmacote (Sigma) were used to avoid loss of activity 
35 resulting from protein adsorption to the glass tubes. 
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The purification of one representive 8L 
preparation of conditioned medium is summarized in the 
following table. 
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PURIFICATION OF HB-EHM 
FROM U-937 CONDITIONED MEDIUM 

Purification Protein Maximal* Purification Yield c 

ste P (ug) Stimulation (fold) f%) 

(ng/ml) v ' 

Bio-Rex 70 450,000* 3 f 750 1 100 % 

Cu-chelating 90,000 b 3,700 l 2 3% 
Sepharose 

TSK-Heparin 295 d 16.4 229 18% 

C 4 RP-HPLC 1.2 8 0.5 7,500 15% 

*• The maximal stimulation was determined from J [H] -thymidine 

b incorporation into smooth muscle cell DNA. 

. Protein was estimated by using A 1 * =14 ug. 

The yield was based on the total activity in the first 
d partially purified fraction. 

e - Protein was estimated by using A 1 * 14 «= 140 ug. 

Protein was estimated by amino acid analysis. 

This purified form of the protein stimulated 
proliferation of BALB/c mouse 3T3 fibroblast cells at a 
concentration between 50 pg/ml and 1,000 pg/ml; 
stimulated proliferation of bovine aortic smooth muscle 
cells at a concentration between 50 pg/ml and 500 pg/ml, 
and stimulated proliferation of human keratinocytes at a 
concentration between 100 pg/ml and 2 ng/ml. The factor, 
however, did not stimulate proliferation of bovine 
capillary endothelial cells. 

To determine the amino acid sequence of this 
purified form of the protein, approximately 1.7 ug of 
protein obtained after cation exchange, copper-affinity, 
and heparin-aff inity chromatography and two cycles of c 4 - 
reversed phase HPLC of 20 L of conditioned medium were 
loaded onto an Applied Biosystems gas-phase protein 
sequencer. Twenty rounds of Edman degradation were 
carried out, and identifications of amino acid 
derivatives were made with an automated on-line PTH-amino 
acid analyzer (model 477A, Applied Biosystems, Foster 
City, CA) . The yield of the amino-terminal residue was 
177 pmoles. Amino acid assignments made for cycles 1-20 
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were as follows (SEQ ID NO: 3): Val-X-Leu-Ser-Ser-Lys- 
Pro-Gln-Ala-Leu-Ala-X-Pro-Asn-Lys-Glu-Glu-His-Gly-Lys, 
where X is unknown or questionable. 

In subsequent purification of HB-EHM from U-937 
5 conditioned medium (as described above) , a different C 4 
column was used for the RP-HPLC purification steps, and 
BALB/c 3T3 3 [H] -thymidine incorporation assays were 
carried out using ten-fold higher amounts of protein from 
each fraction. Using such methods, four major peaks of 

10 growth factor activity were eluted from the column at 33, 
34.5, 43.3, and 47.8 minutes subsequent to the start of a 
15% - 40% acetonitrile gradient. On a non-reducing 15% 
polyacrylamide/SDS gel, the 33, 43.3, and 47.8 minute 
peaks of activity corresponded to proteins of apparent 

15 molecular weight: 22, 23, and 22.5 kD, respectively. The 
34.5 minute peak sometimes displayed two protein bands, 
one of apparent molecular weight 22 kD and the other of 
apparent molecular weight 19 kD; in other preparations, 
only the band of apparent molecular weight 19 kD band was 

20 detected. A fifth peak of growth factor activity was 
also detected in some preparations. This peak eluted 
between the 34.5 and the 43.3 minute peaks and contained 
a protein species of apparent molecular weight 24 kD 
under non-reducing polyacrylamide/SDS gel electrophoresis 

25 conditions. 

The most predominant form of HB-EHM in these 
purifications was generally the form eluting at 33 
minutes. 

The specific activities of the additional growth 
30 factor species were analyzed; all were found to be nearly 
equivalent in their ability to stimulate DNA synthesis in 
BALB/c 3T3 cells and to compete with 125 I-EGF (see below) 
for binding to the EGF receptor. These results suggested 
that the growth factor species contained in the different 
35 isolated peaks all corresponded to forms of HB-EHM. 
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Amino-terminal sequencing was then carried out on 
five additional samples of HB-EHM representing several of 
the various peaks of growth factor activity as described 
above. As indicated above, by SDS-polyacrylamide gel 
5 electrophoretic analysis, these samples were found to 
contain protein species of varying apparent molecular 
weights. Specifically, sample #1 (i.e., the peak eluting 
between the 34.5 and 43.3 minute peaks) contained a 
species with an apparent molecular weight of 24 kD; 

10 sample #2 (i.e., the 33 minute peak) contained a species 
with an apparent molecular weight of 22 kD; sample #3 
(i.e., the 34.5 minute peak) contained a mixture of a 
species of apparent molecular weight 22 kO and a species 
of apparent molecular weight 19 kD; sample #4 (i.e., an 

15 independent isolation of the 34.5 minute peak) consisted 
primarily of a species of apparent molecular weight 19 
kD; and sample #5 (i.e., the 43.3 minute peak) contained 
a species with an apparent molecular weight of 23 kD. An 
attempt to obtain the amino-terminal sequence of sample 1 

20 was unsuccessful, indicating that this protein is amino- 
terminally blocked. The amino-terminal amino acid 
sequences obtained for Samples 2, 3, 4, and 5 were: 

Sample #2 
(SEQ ID NO: 4) : 
25 RVXLSSKPQALAXPNKEEHGKRKKKGKGLGKKRDPXLRKYKDFXIHGEXXY 

Sample #3 

(SEQ ID NO: 5) : RVXLSSKPQALAXPNKEE (approx. 75% of 
sample) 

(SEQ ID NO: 6) : SSKPQALAXXNXEE (approx. 5% of sample) 
30 (SEQ ID NO: 7) : ALAXXNKXEXGKR (approx. 20% of sample) 



Sample #4 
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(SEQ ID NO: 8) : RVXLSSKPQAIAXPNKEEHGKRKK (approx. 65% of 
sample) 

(SEQ ID NO: 9): XXKPQALAXXNXE (approx. 5% of sample) 
(SEQ ID NO: 10) : ALAXPNKEEXGKR (approx. 30% of sample) 

5 Sample #5 

(SEQ ID NO: 16): DLQEADLDLLRVXLXS 

These amino-terminal sequencing results indicated 
that HB-EHM r as isolated by the purification schemes 
outlined in the examples above, can have several forms 
10 which differ in their amino termini. These include: a 
form with the originally identified amino-terminus (see 
SEQ ID NO: 3) (VXLSSKPQALA ...); a form with a terminus 
extended one amino acid amino-terminal to the original 
end (see SEQ ID NO: 4) (RVXLSSKPQ ...); a form with a 
15 terminus extended 11 amino acids amino-terminal to the 
original end (see SEQ ID NO: 16); a form lacking the first 
three residues of the original form (see SEQ ID NO: 6) 
(SSKPQALA . . . ) ; and a form missing the first eight 
residues of the original form (see SEQ ID NO: 7 and 10) 
(ALAXPNKE ...). In addition, the observation that the 
largest (24 kD) species is amino-terminally blocked 
suggests that a form of HB-EHM may exist that is extended 
amino-terminally from the sequence (see SEQ ID NO: 16) 
DLQEADLDLLRV. 

To obtain internal amino acid sequence 
information, aliquots of samples 2, 3, and 4 were 
combined and subjected to trypsin digestion in order to 
generate peptide fragments for sequencing. For this 
analysis, the combined samples were dried, resuspended in 
200 nl of 6M guanldine-HCl/0.5M Tris-HCl, pH 8.0/lmM 
EDTA/lOmM dithiothreitol, and incubated for 60 minutes at 
37°C. Iodoacetamide (0.925 mg) was added to a final 
concentration of 25mM, and the solution was incubated at 
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room temperature for 30 minutes. These two treatments 
were carried out to reduce and alkylate the cysteine 
residues in the protein. To modify lysine residues, 
succinic anhydride (loo mg/ml in acetonitrile) was added 
5 in four 5j*l aliquots, with a five minute incubation at 
room temperature between each addition. The protein 
mixture was desalted by passage through a C 4 -reversed 
phase HPLC column, dried, resuspended in 200 fil of lOOmM 
ammonium bicarbonate, and digested with 0.5 jug of trypsin 

10 at 25 °C for four hours. A second aliquot of trypsin (0.3 
Mg) was added, and the reaction was incubated for two 
additional hours at 27°C. Digestion products were 
separated on a C 18 -reversed phase HPLC (RP-HPLC) column 
and subjected to amino terminal sequencing. 

15 The sequencing results indicated that the succinic 

anhydride treatment gave only a partial blockage of the 
lysine residues in the combined HB-EHM sample. Many of 
the fractions collected from the c l8 RP-HPLC column 
contained a mixture of peptide fragments. The amino acid 

20 sequences determined and the peptide residues as 
designated in SEQ ID N0:1 were: 



Fraction 



Sequencer si 



Peptide 



Residues 



J 



YVKELR 123 to 128 

DFCIHGECK 114 to 122 

KYKDFCIHGECKYVK 111 to 125 

DFCIHGECKYVKELR 114 to 128 

KYKDFCIHGECKYVKELR 111 to 128 

KYKDFCIHGECK ill to 122 

KYKDFCIHGECKYVKELR 111 to 128 

CHGLS 143 to 147 

KYKDFCIHGECKYVK 111 to 125 



S 



Q 



X 



T 



Additional amino acid sequence information was 
obtained by generating and sequencing tryptic fragments 
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from the peak of HB-EHM activity eluting from the C 4 
column at 47.8 minutes. A total of about 4 pq of the 
47.8 minute peak material was dried and resuspended in 
150 Ml of 4M guanidine-HCl, 0.1M Tris-HCl, pH 8.0. 
5 Dithiothreitol (DTT) was added to a final concentration 
of 20 mM, and the reaction was incubated at 37 °c for 60 
minutes. The reduced cysteines were then alkylated by 
the addition of solid iodoacetamide to a final 
concentration of 25 mM followed by incubation of the 
10 reaction mixture for 30 minutes at room temperature. 

After desalting on a C 18 reversed phase HPLC column (4.6 X 
150 mm, Vydac; gradient of 10% to 40% acetonitrile in 
0 . 1% trif luoroacetic acid) , the protein was dried and 
resuspended in 150 Ml of 0.1 M ammonium bicarbonate, 
is Trypsin (0.4 jug, Boehringer Mannheim) was added, and the 
reaction was incubated at 27 °C for 2 hours. The 
digestion products were then fractionated on the C 18 HPLC 
(4.6 x 150 mm) column using a gradient of 3% to 63% 
acetonitrile in 0.1% trif luoroacetic acid. All of the 
20 peaks of any appreciable absorbance (at a detection 
wavelength of 214 nM) were collected and subjected to 
sequence analysis. The amino acid sequences determined 
and the peptide residues as designated in SEQ ID N0:1 
were: 

Fraction Amino Acid Sequence Peptide 
Residues 



5 


KRDPCLR 


104-110 


6 


RDPCLR 


105-110 


8 


APSCI CHPG YHGE . . . 


129-141 


10 


DFCIHGECK 


114-122 


11 


KYKDFCI . . . 


111-117 


12 


YKDFCIHGECK 


112-122 


14 


CHGLSL. . . 


143-148 
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17 DLQEADLDLLXV . . . 63 _ 

74 

18 DLQEADLDL. . . 63 _ 71 

The dots indicate that the full extent of the 
peptide sequence was not determined. 

The most amino-terminal fragments obtained were 
initiated at aspartic acid residue 63; however, since 
this residue occurs directly after a potential trypsin 
cleavage site (i.e., carboxy-terminal to the arginine 
residue at position 62) , it was not possible to conclude 
from this data that residue 63 necessarily represents the 
absolute amino-terminus of HB-EHM. The most carboxy- 
terminal tryptic fragment obtained extended at least 
through residue 148 (Leu) of the predicted precursor. 
Thus, the tryptic fragment sequences indicate that a 
mature form of HB-EHM protein exists that extends over at 
least 86 amino acids. 

Further characterization studies were carried out 
on HB-EHM of amino-terminal sequence 

RVXLS SKPQALAXPNKEEHGKRKK (i.e., the form contained in the 
33 minute peak) which is the predominant form in U-937 
cell conditioned medium and therefore most available for 
structural analysis. The isoelectric points of HB-EGF 
and EGF were determined by chromatofocusing (as generally 
described in Fagerstam et al., 266 J. Chromatogr. 523, 
1983) . Briefly, 100 ng of the 33 minute peak form of HB- 
EHM (as described above) was applied to a Mono P column 
after equilibration of the column with 25 mM 
ethanolamine-acetic acid buffer, pH 9.4. The column was 
washed with this buffer for 20 minutes and subsequently 
with 0.1X Polybuffer 96-acetic acid, pH 6.0 (Pharmacia) 
for 40 minutes at a flow rate of l ml/min. For 
comparison, the isoelectric point of human recombinant 
EGF (Collaborative Research, Bedford, MA) was also 
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determined. 100 ng of EGF was applied to a Mono P column 
after equilibration of the column with 25 mM imidazole- 
HC1 buffer, pH 7,4. The column was washed with this 
buffer for 20 minutes and subsequently for 40 minutes 
» with 0.125X Polybuffer 74-HC1 (Pharmacia), pH 4.0 at a 
flow rate of l ml/min. Fractions of l ml were collected 
and 5 Ml of each fraction were tested for growth factor 
activity. The pi of the 33 minute peak form of HB-EHM 
was found to be between 7.2 and 7.8, compared to the pi 
' Of EGF which was 5.3-5.5. 

HB-EHM may be post-translationally modified. 
Recombinant HB-EHM produced in E. coli appears to be 
about 5-6 kDa smaller than native HB-EHM as judged by 15% 
polyacrylamide/SDS gel electrophoresis, and there are two 
threonine residues predicted from the nucleotide sequence 
that did not appear in protein sequencing (suggesting 
that these threonine residues might be sites of post- 
radiational modification) . since threonine is a site 
for O-glycosylation, purified 125 I-HB-EHM (i.e., the 33 
minute peak described above) was produced and tested for 
the presence of O-linked glycosylation by treatment with 
endo-N-acetylgalactosaminidase (O-glycanase) as follows. 

Two nq of HB-EHM were radiolabeled with Na 125 I 
(200 nd/2 Ml; ICN, Costa Mesa, CA) using Iodo-Beads 
(Pierce, Rockford, IL) by the method described in 125 
Analytical Biochem. 427, 1982. Briefly, the iodo-beads 
were washed with 50 mM Tris-HCl (pH 7.4), dried, and 
added to a microfuge tube containing 200 nl 50 mM Tris- 
HCl (pH 7.4) and 200 /*Ci 125 I. After a 5 minute 
incubation HB-EHM (2 ng in 50 Ml Tris-HCl, pH 7.4) was 
added. After a radioiodination period of 15 minutes, the 
reaction mixture was applied to a small heparin-Sepharose 
column (50 Ml) equilibrated with 50 mM Tris-HCl (pH 7.4), 
0.5% BSA, 200 mM NaCl, 10 mM KI. The column was washed 
extensively with this buffer and 125 I-HB-EHM was eluted 
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with the equilibration buffer containing 2M NaCl. The 
specific activity of 125 I-HB-EHM was 22,500 CPM/ng. For 
the removal of o-linked oligosaccharide chains from l25 i- 
HB-EHM, the 125 I-HB-EHM (0.5 ng in 10 Ml SO mM Na 
5 Cacodylate, pH 6.0, 25 mM CaCl 2 , 0.1% SDS and 10 mM DTT) 
was boiled for 5 minutes. The reaction mixture was 
adjusted to 1% Triton X-100 and 2 mM 

phenylmethylsulf onylf luoride (PMSF) , and the sample was 
digested first with neuraminidase (0.01 units, 
io Calbiochem) for 60 minutes at 37 »C and then with endo-N- 
acetylgalactosaminidase (0.25 milliunits; Genzyme, 
Cambridge, MA) at 37 °C overnight. The sample was 
analyzed by SDS/polyacrylamide gel electrophoresis. The 
polyacrylamide gel was dried and exposed to Kodak X-Omat 
15 autoradiography film for 12 hours at -70 °C. 

Treatment with O-glycanase lowered the apparent 
molecular weight of HB-EHM from 18-20 kDa to about 14-16 
kDa (as judged by polyacrylamide gel electrophoreses) 
suggesting that this polypeptide was modified extensively 
by o-linked glycosylation. 

Cloning 

To isolate clones encoding HB-EHM, a cDNA library 
constructed from the mRNA of the human histiocytic 
lymphoma cell line U-937 was screened with an 
oligonucleotide probe, the design of which was based on 
the above described HB-EHM amino acid sequence [SEQ ID 
NO: 3: VXLSSKPQALAXPNKEEHGK] . This library, purchased 
from Clontech Laboratories (Palo Alto, CA) , was produced 
using mRNA from U-937 cells differentiated by addition of 
phorbol myristate acetate (PMA) at 50 ng/ml for 3.5 days; 
cDNA synthesized from the mRNA was cloned into the EcoRl 
site of cloning vector AgtlO. The number of independent 
recombinants was reported as 1.4 X 10 6 . An aliquot of 
the library was introduced into host cells [i.e., e. coli 
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strain NM538, Ji§dB(r k "m k + ) supF (Frischauf et al., 170 J. 
Mol. Biol. 827, 1983, hereby incorporated by reference)] 
by standard techniques, and the resultant plaques were 
immobilized on Hybond-N + nylon membranes (Amersham 
5 corporation, Arlington Heights, IL) and screened, by 
standard techniques, using as a probe, a synthetic 
oligonucleotide based on the amino-terminal amino acid 
sequence originally derived for HB-EHM. This probe was a 
45-residue "codon-choice" oligonucleotide, designed by 
10 assuming that the codons encoding each of the amino acids 
6 through 20 in the HB-EHM gene corresponded to the codon 
most commonly used for that amino acid in human genes 
(excluding the interferons and collagen) reported in the 
literature and databases (The codon frequency table used 
15 in designing the probe was generated at California 
Biotechnology Inc., Mountain View, CA., by Dr. Barry 
Greenberg) . For proline and glycine, two possible codon 
choices were incorporated into the probe to increase the 
likelihood that a correct choice would be present. In 
addition, CAT was used as the histidine codon at position 
18, rather than the more common CAC, in order to 
eliminate the dinucleotide sequence CpG which is often 
disfavored in mammalian genes (see, e.g., Bird, 8 ffucl. 
Acids .Res. 1499, 1980). The resultant 8-fold degenerate 
probe (probe 4955) , synthesized as the antisense of the 
predicted coding region, was of the following sequence 
(SEQ ID NO: 11) : 
Probe 4955: 

5 ' - CTTGCCATGCTCCTCCTTGTTAGGCTTGGCCAGGGCCTGAGGCTT-3 • 
T G G 

For use in screening the U-937 cDNA library 
filters, probe 4955 was 5« end-labelled with ?-[ 32 P]-ATP 
(Amersham Corporation, Arlington Heights, IL) . 
Radioactive-labelling of the probe and filter 
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hybridization were carried out by standard techniques; 
hybridization conditions included 1.7 x 10 8 cpm of probe, 
a hybridization temperature of 42°c, and 200 ml of 
hybridization mixture (20% formamide, 6X SSC, 50mM sodium 
5 phosphate pH 6.8, 100 /ig/ml autoclaved DNA, and 5X 
Denhardt's solution). Following an overnight 
hybridization, filters were rinsed briefly three times 
with IX SSC containing 0.1% sodium dodecyl sulfate (SDS) . 
Filters were then washed twice for 30 minutes with 

10 shaking at room temperature in IX SSC/ 0.1% SDS. To 
remove weakly-hybridizing probe molecules, the filters 
were washed in IX SSC/ 0.1% SDS at 43-46°C for 20 minutes 
in a shaking water bath. Filters were then exposed, at 
below -70 °C, to X-ray film between two intensifying 

15 screens. 

Sixteen strongly-hybridizing plaques were detected 
and were purified using standard techniques. Phage DNA 
from several of these plaques was purified as described 
by Sambrook et al. (Molecular Cloning: A Laboratory 
20 Manual, second edition, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989) ; restriction digest 
analysis with £ssRI indicated that the various isolated 
phage contained cDNA inserts ranging in size from about 
1.1 tt to 3 kb. 

25 The insert in one such phage, AU2, was chosen for 

complete nucleotide sequence analysis. Fragments of this 
cDNA insert were ligated into appropriately-digested 
M13mp phages according to standard techniques, and were 
sequenced using the Sanger dideoxy method (Sanger et al. , 

30 143 J. Hoi. Biol. 161, 1980). This sequence is presented 
in Fig. 3. 

Referring to Fig. 3 (SEQ ID N0:1), nucleotides 
481 - 540 encode an amino acid sequence that matches the 
initial amino-terminal HB-EHM sequence determined [SEQ ID 
35 NO: 3: VXLSSKPQALAXPNKEEHGK] . The nucleotide sequence 
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predicts a threonine residue at the second and twelfth 
amino acid positions, where no clear residue was detected 
by standard automated protein sequencing techniques in 
any of the HB-EHM samples analyzed, it is therefore 
5 possible that the threonines corresponding to positions 2 
and 12 of the form of HB-EHM originally sequenced are 
modified in mature HB-EHM, possibly by o-linked 
glycosylation. 

The coding sequence for HB-EHM lies in an open 
reading frame extended between the TGA termination codons 
at nucleotides 73-75 and nucleotides 886-888 (Fig. 3, SEQ 
ID NO:l). within this open reading frame the ATG 
(methionine) codon at nucleotides 262-264 represents the 
most likely site of translation initiation, since it is 
the only ATG in the sequence lying 5' to nucleotides 481- 
483 (the codon for the valine which is the first amino 
acid of the amino-terminal sequence originally determined 
for HB-EHM) . The sequence from nucleotides 253 to 265 
(TGCGGGACGATjgA) surrounding this ATG shares homology with 
the consensus sequence for vertebrate translation 
initiation sites (SEQ ID NO: 12) [(GCC) GCCA/GCCATGG } 
Kozak, 15 ffucl. Acids. Res. 8125, 1987], and, in 
particular, contains the highly conserved A residue at 
position -3 from the ATG. In addition, the predicted 
stretch of hydrophobic amino acids encoded immediately 3' 
to this ATG is indicative of a secretion signal sequence 
(von Heijne, 133 .Eur. J. Biochem. 17, 1983) , as would be 
expected for the amino-terminus of the primary 
translation product for a secreted protein like HB-EHM. 
Mature HB-EHM thus appears to be synthesized as part of a 
208-amino acid precursor, encoded by nucleotides 262-885 
in the AU2 HB-EHM CDNA. (Pig. 3, SEQ ID NO:l). 

The amino termini determined from the various 
purified samples of HB-EHM lie at amino acids 63, 73, 74, 
77, and 82 in the predicted translation product shown in 
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Fig. 3 (SEQ ID NO:l). The amino acid residues (i.e., 
residues 20 - 62) lying between these sites and the 
putative hydrophobic secretion signal on the amino 
terminus of the precursor protein appear to represent a 
5 "pro" sequence, cleaved off post-translationally during 
the formation of mature HB-EHM. However, the amino 
terminus of the largest protein species isolated, the 24 
kD form, has not yet been determined due to a blocking 
modification; assuming this form is an HB-EHM derivative, 

10 it may extend amino-terminally to contain some of the 
amino acids lying between residues 19 and 63. No 
differences in activity have yet been noted between the 
various forms of HB-EHM. 

A search of the National Biomedical Research 

15 Foundation (NBRF) protein database, using the FASTA 
program (Pearson and Lipman, 85 Proc. Natl. Acad. Sci. 
USA 2444, 1988, Devereux et al., 12 Nucl. Acids Res. 387, 
1984) indicated homology between HB-EHM and the 
EGF/TGFa/amphiregulin family of proteins, in particular, 

20 between amino acid residues 108 and 143 (Fig. 3, SEQ ID 
' NO:l) where HB-EHM contains all six cysteine residues 
conserved in the EGF (SEQ ID N0:14)/TGFa (SEQ ID 
NO:l5)/amphiregulin (SEQ ID NO: 13) family members (Fig. 
2) . in the region stretching from the first to the sixth 

25 cysteine, HB-EHM conserves 40% (15 of 37) of the residues 
in human EGF, 42% (15 of 36) of the residues in human 
TGFa, and 53% (19 of 36) of the residues in human 
amphiregulin. Another feature shared by several of the 
family members is a transmembrane precursor structure 

30 that is processed amino- and carboxy-terminally to 

release the mature growth factor. The precursor sequence 
of HB-EHM shown in Fig. 3 (SEQ ID N0:1) contains a 
strongly hydrophobic internal domain comprising amino 
acids 161 - 184, which by analogy with the other members 

35 of the EGF/TGFa/amphiregulin family is predicted to be a 
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transmembrane domain. The mature carboxy-terminus of HB- 
EHM is in turn predicted to lie between the last of the 
conserved cysteine residues at amino acid residue 143, 
and the start of the hydrophobic domain at residue 161. 
5 One of the sequenced tryptic fragments of HB-EHM 
consisted of the sequence CHGLS (see above) . This 
sequence corresponds to residues 143 - 147 (see Fig. 3, 
SEQ ID NO:l). since the sequence of this fragment ends 
before a tryptic cleavage site, this result is consistent 
io with the carboxyl terminus of some forms of HB-EHM lying 
at amino acid residue 147. Another sequenced tryptic 
fragment of HB-EHM consisted of the sequence CHGLSL, 
corresponding to residues 143-148 (see Fig. 3, SEQ ID 
N0:l). The sequence data was not sufficient to show that 
15 residue 148 represents the car boxy-terminal end of the 
protein. Indeed, experiments with reduced and 
carboxymethylated synthetic peptides indicated that this 
tryptic fragment eluted from the C 18 RP-HPLC column at 
approximately the same concentration (percent) 
20 acetonitrile as does the synthetic peptide CHGLSLP. 

These results are consistent with residue 149 also being 
a carboxy-terminal end on some mature forms of HB-EHM. 

Other equivalent clones can be isolated by 
hybridization screening techniques well known to those of 
25 ordinary skill in this art. 

Binding pf Heparin Binding egf— Homo i n gous Mitogen to egf 
Receptors on A-a 3 l Cells 

Because HB-EHM is structurally a member of the EGF 
family, it was tested for biological properties 
30 characteristic of EGF, e.g., ability to bind an EGF 
receptor. Competitive binding assays were used to 
measure this interaction and were performed as follows. 
[ 125 I]EGF (2 ng, 1.2 X 10 5 dpm, Collaborative Research, 
Bedford, MA) was added to 24 well plates containing 
35 confluent A-431 cells (Fabricant et al., 74 Proc. Natl. 
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Acad. Sci. USA 565, 1977; Haigler et al., 75 Proc. Natl. 
Acad. Sci. USA 3317, 1978, available from the ATCC, 
Accession No. CRL 1555) . Increasing amounts of HB-EHM or 
recombinant human EGF were then added, and the remainder 
5 of the binding assay was performed as described in Singh 
(147 Meth. Enzymol. 13, 1987, hereby incorporated by 
reference) and Kimball et al. (771 Biochem. Biophys. Acta 
82, 1984, hereby incorporated by reference). Competitive 
[ 125 I]EGF binding to bovine aortic smooth muscle cells 

10 (BASMC) was measured as described above for A-431 cells, 
except that BASMC were used and plated in 6-well plates. 
Purified HB-EHM was found to bind to EGF receptors on A- 
431 cells and SMC. It inhibited essentially 100% of the 
binding of [ 125 I]EGF to A-431 cells as did EGF. HB-EHM 

15 had a greater affinity than EGF for EGF receptors on 
BASMC; HB-EHM showed a 50% inhibition of [ 125 I]EGF 
binding at 63 pg/ml (2.9 pM) compared to EGF which showed 
a 50% inhibition at 290 pg/ml (48 pM) . 

Reflecting these different affinities for the EGF 

20 receptors on BASMC, HB-EHM was found to be a more potent 
BASMC mitogen than EGF by the following assay. BASMC 
were plated in DMEM, 10% calf serum, 100 U/ml penicillin, 
100 fig/ml streptomycin sulfate at 5 x 10 3 cells/well in a 
24 well plate, and, after attachment of the cells (e.g., 

25 following an overnight incubation) , the media was 

replaced with DMEM, 1% calf serum, and penicillin and 
streptomycin as above. HB-EHM, recombinant human EGF 
(Creative Biomolecules , Hopkinton, MA) , or recombinant 
PDGF (Creative Biomolecules, Hopkinton, MA) was then 

30 added to the wells. Cells were counted after three days. 
The activity of HB-EHM on BASMC proliferation was more 
comparable to that of PDGF than to that of EGF. HB-EHM 
at 100 pg/ml, PDGF at 500 pg/ml, and EGF at 4 ng/ml 
similarly stimulated BASMC proliferation (i.e., a 2.5- 

35 fold increase) . 
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Besides inhibiting the binding of 125 I-EGF to the 
EGF receptor, HB-EHM (i.e., the 33 minute peak form 
described above) also triggered autophosphorylation of 
the EGF receptor. This was shown by plating A-431 cells 
5 in 6-well plates and culturing in Dulbecco's Modified 
Eagle's Medium (OMEM) containing 10% fetal bovine serum 
and antibiotics. Confluent monolayers were incubated 
with 5 ng/ml EGF (Collaborative Research, Waltham, MA) or 
5 ng/ml HB-EHM. After 15 minutes, the cells were washed 

10 with ice-cold phosphate-buffered saline containing 0.4 mM 
EDTA, 10 mM Na-fluorate, 10 mM Na-pyrophosphate and 0.4 
mM Na-orthovanadate, scraped and lysed in 100 jil of 
PI/RIPA buffer [1% NP-40 (Pierce, Rockford, IL) , 1% 
deoxycholate (Aldrich, Milwaukee, WI) , 0.1% SDS, 1% 

15 aprotinin (Boehringer-Mannheim, Indianapolis, IN), 1 mM 
PMSF (Pierce) , 2 mM EDTA (Sigma) , 10 mM Na-fluorate, 
(Sigma, St. Louis, MO), 10 mM Na-pyrophosphate (Baker, 
Phillipsburg, NJ) , 0.4 mM Na-orthovanadate, (Aldrich), 10 
mM iodoacetamide (Aldrich) in phosphate buffered saline] . 

20 Thirty pi of supernatant clarified by spinning for a 5 
minute period in a microfuge was analyzed by reducing 
SDS/polyacrylamide gel electrophoresis (6% polyacrylamide 
gel) . After transfer to a nitrocellulose membrane 
(Schleicher and Schuell, Keen, NH) , EGF receptor 

25 phosphorylated on tyrosine residues was detected by 
Western blot analysis using anti-phosphotyrosine 
antibodies (PY-20, ICN Biomedicals, Costa Mesa, CA) and 
subsequent development with alkaline phosphatase- 
conjugated rabbit anti-mouse IgG antibodies (Promega, 

30 Madison, WI) as previously described (Wada et al., 61 
Cell 1339, 1990). A standard of phosphorylated EGF 
receptor was purchased from UBI (Lake Placid, NY) . 

HB-EHM stimulated phosphorylation of a 170 kDa 
protein that co-migrated with an EGF receptor standard 
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and which was also phosphorylated when cells were 
stimulated by EGF. 

Mammalian Bypression Vgr^r» rf 

For expression of HB-EHM in mammalian cells, the 
5 HB-EHM coding region from cDNA clone AU2 was inserted 
into two different expression vectors, pMTN and pAXneoR. 

The vector pMTN consists of (i) a flisdlli fragment 
of SV40 that spans the enhancer region of this virus, 
(ii) a Hindlll - gamHI fragment spanning the promoter 
10 region of the human metallothionein II a gene, (iii) a 
SamHl site for insertion of the coding region to be 
expressed, (iv) a £amHi - EcoRi fragment spanning the 3' 
untranslated region of the human growth hormone gene (to 
provide a polyadenylation signal) , (v) an EsqRI - Hindlll 
15 fragment containing all of the bacterial plasmid puce 
(except for a portion of the polylinker) to provide a 
bacterial origin of replication and the ampicillin 
resistance gene, and (vi) a fragment containing the SV40 
origin of replication, and encoding resistance to 
20 neomycin and its analog, G418. This vector was 

constructed by first digesting the vector pMTpn (Greene 
et al., 231 Science 1150, 1986, hereby incorporated by 
reference) with §mX, adding fiajgHI linkers, digesting 
with 23JBHI, and religating the plasmid, to create the 
25 vector pMTSV40polyA-Bam. Two fragments were then 

isolated from the vector pSV2-neo (Southern and Berg, 1 
J. Jfol. Appl. Genet. 327, 1982, hereby incorporated by 
reference) : the Evu.il - HiadHI fragment containing the 
SV40 early region promoter; and the Hindlll - ggmHi 
30 fragment containing the neomycin resistance coding 

region. These two fragments were ligated together, and 
then treated with Klenow-fragment DNA polymerase I to 
produce blunt ends. The resulting DNA fragment was then 
ligated into the vector pMTSV40polyA-Bam, which had been 
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partially digested with HindHI and treated with Klenow- 
fragment DMA polymerase I to produce blunt ends. 

The vector pAXneoR consists of: (i) the 4.3 kb 
£S2RI - AJLul fragment of the human j8-actin gene isolate 
5 pl4T/?-17 containing the /J-actin gene promoter (Leavitt et 
al., 4 Mol. cell. Biol. 1961, 1984, hereby incorporated 
by reference; Ng et al., 5 Mol. Cell. Biol. 2720, 1985, 
hereby incorporated by reference) ; (ii) a short 
polylinker region for insertion of the coding region to 
10 be expressed; (iii) a 2.3 kb fragment, derived from the 
plasmid pcDVl (Okayama and Berg, 3 Mol. Cell. Biol. 280, 
1983, hereby incorporated by reference), containing the 
late region polyadenylation signal from the SV40 virus, 
as well as the pBR322 ampicillin resistance gene and 
15 bacterial origin of replication; and (iv) the 3.4 kb 

EyjiII - EeoRI fragment from pSV2-neo (Southern and Berg, 
1 J. Mol. Appl. Genet. 327, 1982, hereby incorporated by 
reference) containing the SV40 early promoter region (and 
origin of replication) linked to the bacterial neomycin 
resistance coding region. 

For insertion into the expression vectors, the 
cDNA insert in clone AU2 was digested with Tagl and XjQnl, 
releasing a fragment extending from nucleotide 219 to 970 
(SEQ ID NO:l), and treated with Klenow-fragment DNA 
polymerase I to produce blunt ends. BamH l linkers were 
ligated to the resulting fragments. After digestion with 
fiamHI, the ligation products were fractionated on an 
agarose gel, and the 761 bp linkered fragment (SEQ ID 
NO:l) spanning the HB-EHM coding region was isolated. 
Ligation of this fragment into the BjfflHI sites of the 
vectors pMTN and pAXneoR (in the orientation such that 
the HB-EHM coding region was operably linked to the 
metallothionein or actin promoter) resulted in the 
plasmids pMTN-HBEGF and pAX-HBEGF. The fragment of DNA 
containing the HB-EHM coding region may be isolated from 
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these plasmids by digesting either pMTN-HBEGF or pAX- 
HBEGF with Bam Hl and isolating the 761 bp fragment. This 
fragment can then be inserted into any appropriate 
expression vector. 
5 The cDNA expression plasmids, pMTN-HBEGF and pAX- 

HBEGF have been deposited with the American Type Culture 
Collection and they, respectively, bear the accession 
numbers: ATCC No. 40900 and No. 40899. Applicant's 
assignee. The Children's Medical Center Corporation, 

io acknowledge their responsibility to replace these 

cultures should they die before the end of the term of a 
patent issued hereon, and their responsibility to notify 
the ATCC of the issuance of such a patent, at which time 
the deposits will be made available to the public. Until 

15 that time, the deposits will be made available to the 
Commissioner of Patents under the terms of 37 CFR §1.14 
and 35 USC §112. 

Host cells used for the recombinant expression of 
HB-EHM were Chinese Hamster Ovary (CHO-K1) cells, 

20 obtained from the American Type Culture Collection 

(Accession No. CCL 61) . These cells were grown at 37°C 
in a 5% C0 2 humidified incubator in CHO growth medium 
[1:1 mixture of Dulbecco's Modified Eagle's Medium (DMEM- 
21) : Coon's F12, supplemented with 10% fetal bovine 

25 serum, 2 mM L-glutamine, 50 units/ml of penicillin, and 
50 fig/ml of streptomycin]. Approximately 24 hours prior 
to transfection with the HB-EHM expression plasmids, CHO- 
Kl cells were plated in 10-cm tissue culture dishes at a 
density of 2 - 3 X 10 s cells/dish. Plasmid DNA (20 /zg) 

30 was introduced into the CHO cells using the calcium 

phosphate precipitation method described in Wigler et al. 
(16 Cell 111, 1979) . Four to five hours after DNA 
addition, the medium was aspirated from each dish and 
cells were "shocked" by treatment for one minute with 

35 3 ml of 15% glycerol in HEPES-buffered saline. The 
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glycerol was removed, the cells in each dish were rinsed 
with 8 ml of serum-free medium (CHO growth medium lacking 
10% fetal bovine serum) , and 8 ml of CHO growth medium 
was added to each plate. Cells were incubated for 24 
5 hours. 

For "transient" expression experiments, the medium 
was replaced, after the 24 hour incubation, with 5 ml of 
serum-free medium. In the case of cells transfected with 
pMTN-HBEGP or the pMTN parental plasmid, the serum-free 

10 medium was supplemented with 50 mm ZnS0 4 and l |iM 

dexamethasone. The cells were incubated for 36 hours. 
The medium was then collected and assayed for the 
presence of HB-EHM, using the [ 3 H] -thymidine uptake assay 
in Balb/c mouse 3T3 cells described above. In 

15 preliminary experiments, conditioned medium from CHO-K1 
cells transiently transfected with pAX-HBEGF exhibited 20 
U/ml of stimulatory activity; cells transfected with the 
control plasmid pAXneoR yielded 10 U/ml of activity. 

For the selection of stable pools of transfected 

20 cells, G418 (Geneticin; GIBCO, Grand Island, NY) was 

added to the dishes after the 24 hour incubation period. 
The final concentration of the G418 was 1.0 mg/ml. 
Selection for G418-resistant cells extended over a period 
of about 2 weeks. Cells were split as necessary and re- 

25 fed with CHO growth medium containing G418. Following 
establishment of stable pools, cells were switched to 
serum-free medium (containing ZnS0 4 and dexamethasone in 
the case of pMTN- and pMTN-HBEGF-transfected cells as 
described above), and incubated for 24-48 hours. The 

30 conditioned medium was collected and assayed for HB-EHM 
activity using the [ 3 H] -thymidine uptake assay described 
above. 

Prokarvotic Fvpression Vectors 

If one desires to produce HB-EHM that is not 
35 glycosylated, a DNA sequence encoding mature HB-EHM can 
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be expressed in a prokaryotic host cell. DNA encoding 
for mature HB-EHM is carried on a vector operably linked 
to control signals capable of effecting expression in the 
prokaryotic host. If desired, the coding sequence may 
5 contain, at its 5' end, a sequence encoding any of the 
known signal sequences capable of effecting secretion of 
the expressed protein into the periplasmic space of the 
host cell, thereby faciliating recovery of the protein. 
Prokaryotes most frequently used are represented by 
10 various strains of E. coli r however, other microbial 

strains may also be used. Plasmid vectors are used which 
contain replication origins, selectable markers, and 
control sequences derived from a species compatible with 
the microbial host. For example, E. coli may be 
15 transformed using derivatives of pBR322, a plasmid 
constructed by Bolivar et al. (2 Gene 95, 1977) using 
fragments derived from three naturally-occurring 
plasmids, two isolated from species of Salmonella . and 
one isolated from E. coli . pBR322 contains genes for 
ampicillin and tetracycline resistance, and thus provides 
multiple selectable markers which can be either retained 
or destroyed in constructing the desired expression 
vector. Commonly used prokaryotic control sequences 
(also referred to as "regulatory elements") are defined 
herein to include promoters for transcription initiation, 
optionally with an operator, along with ribosome binding 
site sequences. Promoters commonly used to direct 
protein expression include the beta-lactamase 
(penicillinase), the lactose (lac) (Chang et al., 198 
mature 1056, 1977) and the tryptophan (trp) promoter 
systems (Goeddel et al., 8 Nucl. Acids Res. 4057, 1980) 
as well as the lambda-derived P t promoter and N-gene 
ribosome binding site (Shimatake et al., 292 Nature 128, 
1981) . 
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Successful production of HB-EHM in E. coll has 
been achieved in two ways; through expression of the 
factor from the plasmid pNA28 as a fusion protein with a 
modified form of the chloramphenicol acetyltransferase 
5 (CAT) protein, and through expression of the factor from 
the plasmid pNA51 as an HB-EHM protein containing an 
added methionine residue at the amino-terminus (termed, 
Met-HB-EHM) . in the case of expression using the vector 
pNA28, the fusion protein produced was cleaved with 
10 cyanogen bromide to release the desired HB-EHM product. 
In the case of expression using the vector pNA5l r the 
Met-HB-EHM product can be used either without further 
truncation or following treatment with cyanogen bromide 
to remove the amino-terminal methionine. Various e. coli 
15 strains can be used as hosts for pNA28 and pNA51; 

preferred hosts include E. coj.1 strain W3110 (American 
Type Culture Collection Accession Number 27325; ATCC, 
Rockville, MD) for pNA28 and E. coli strain B (American 
Type Culture collection Accession Number 23848; ATCC, 
20 Rockville, MD) for pNA51. 

The vector pNA28 consists of (i) an origin of 
replication functional in E. coll derived from the 
plasmid pBR322; (ii) a selectable tetracycline resistance 
gene also derived from pBR322; (iii) the transcription 
25 termination region of the E. coli trp operon (placed at 
the end of the tetracycline resistance gene to prevent 
transcriptional read-through into the £ee promoter 
region) ; (iv) the E. coll £rja operon promoter, used to 
drive expression of the CAT - HB-EHM fusion protein; (v) 
30 the CAT - HB-EHM fusion protein coding sequence; and (vi) 
the T1T2 transcription terminators from the ribosomal RNA 
(EEnB) locus of E. coli (placed at the end of the CAT - 
HB-EHM coding region) . The CAT portion of the fusion 
protein encoded by pNA28 is a truncated, modified version 
35 of chloramphenicol acetyltransferase; the nucleic acid 
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and amino acid sequence of this CAT protein is shown in 
Fig. 5 (SEQ ID NO: 17, 18) (amino acid residues 1-73). 
Amino acid residue 74 of the fusion is a methionine and 
represents a site for cleavage by cyanogen bromide. 
5 Residues 75-149 of the fusion protein correspond to 
residues 73-147 of HB-EHM (see Pig. 3; SEQ ID NO: 1). 

The vector pNA51 is identical in structure to 
pNA28, except that the nucleotide sequence encoding the 
CAT - HB-EHM fusion protein has been replaced by a 
io sequence encoding a methionine amino acid fused to 
residues 73-147 of HB-EHM (see Fig. 3, SEQ ID NO:l). 

E* CPli strains W3110 and B bearing the expression 
plasmids pNA28 and pNA5l, respectively, have been 
deposited with the American Type Culture Collection 
15 (Rockville, MD) and they, respectively, bear the 

accession numbers: ATCC No. and No. . 

Applicant's assignee, The Children's Medical Center 
Corporation, acknowledge their responsibility to replace 
these cultures should they die before the end of the term 
20 of a patent issued hereon, and their responsibility to 
' notify the ATCC of the issuance of such a patent, at 
which time the deposits will be made available to the 
public. Until that time, the deposits will be made 
available to the Commissioner of Patents under the terms 
25 of 37 CFR §1.14 and 35 USC §112. 

For expression of CAT - HB-EHM, a single colony of 
E t coU strain W3110 containing pNA28 was first used to 
inoculate 100 ml of supplemented minimal medium (i.e., M9 
salts supplemented with 0.4% glucose, 2/xg/ml thiamine, 1% 
30 casamino acids, O.imM CaCl 2 , O.BmM MgS0 4 , and 6.25 nq/ml 
tetracycline) containing 40 /xg/ml tryptophan. The 100- 
ml culture was grown overnight with shaking at 37 °C, and 
aliguots of this culture (io ml per liter) were then used 
to inoculate one-liter batches of supplemented minimal 
35 medium containing 4 /*g/ml tryptophan. Usually, eight 
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one-liter batches were inoculated together on a given 
day. Each one-liter culture was incubated at 37°C with 
shaking in a four-liter triple-baffled Erlenmeyer flask 
until an optical density (OD 600 ) of 0.5 - 0.7 was reached, 

5 at which point, 30 mg of 3/?-indoleacrylic acid (Sigma) in 
10 ml of ethanol was added to the flask to induce the trp 
promoter. The cultures were then incubated overnight 
with shaking at 37 °C, and the cells were harvested by 
centrifugation at 4500 rpm in a Sorvall RC-3B centrifuge 

o for 20 minutes. The cell pellets from one to eight one- 
liter batches were resuspended in water or TE buffer 
(20mM Tris-HCl, pH 7.5, 5mM EDTA) , combined, and 
centrifuged to repellet the cells. The resulting washed 
pellet was either used directly for protein purification 

5 or was frozen at -20° for storage prior to cell lysis. 

When CAT - HB-EHM was expressed as described 
above, the fusion protein was localized into inclusion 
bodies within the cells. To isolate the inclusion 
bodies, the fresh or frozen cell pellet obtained as 

o described above was resuspended in TED buffer (20mM Tris- 
HCl, pH 7.5, 5mM EDTA, ImM DTT) , placed on ice, and 
sonicated using a Heat Systems Ultrasonics sonicator 
(power level of 7; 50% pulsed cycle; 0.5 inch diameter 
probe; Farmingdale, NY) . The cells were ruptured using 

5 2-5 two-minute sonication intervals, with a two-minute 
cooling period between each interval. The resulting 
suspension was mixed with 6M guanidine until a final 
concentration of 1M guanidine was reached and was then 
centrifuged for 10 minutes at 16,000 x g (10,000 rpm, 6SA 

o rotor) to pellet the inclusion bodies. The pellet was 
washed by resuspension in TED buffer containing 1M 
guanidine, followed by re-centrifugation at 16,000 x g 
for 10 minutes. 

To cleave HB-EHM away from the CAT portion of the 

5 fusion protein, the inclusion bodies were first 
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resuspended in 70% formic acid (8 - 10 ml per gram wet 
weight of inclusion bodies) . Cyanogen bromide (50 mg per 
gram wet weight of inclusion bodies) was then added, and 
the resulting solution was overlaid with argon and 
» stirred at room temperature for four hours. The solution 
was dried under vacuum, resuspended in 20 mM Tris-HCl, pH 
7.5, 5mM EDTA, 6M urea, and then centrifuged for 10 
minutes at 16,000 x g. The supernatant was retained, and 
the pellet was resuspended a second time in 20mM Tris- 
' HC1, pH 7.5, 5mM EDTA, 6M urea. The resuspension was 
centrifuged as above at 16,000 x g for 10 minutes, and 
the resulting supernatant was combined with the first 
retained supernatant. 

For purification of HB-EHM from the cyanogen 
bromide cleavage mix, the combined supernatants were 
loaded onto a column of SP-Sephadex C25 (2.5cm x 2cm, 
Pharmacia) that had been pre-equilibrated with 20mM Tris- 
HC1, p H 7.5, 5mM EDTA, 6M urea, 0.1M NaCl. After loading 
was complete, the column was washed with the 
equilibration buffer until the absorbance of the eluate 
reached baseline. Bound proteins were then eluted from 
the column with 20mM Tris-HCl, pH 7.5, 5mM EDTA, 6M urea, 
0.6M Naci. At this point, the denatured protein in the 
eluate was allowed to refold by (i) adding glutathione 
(Boehringer-Mannheim) and glutathione disulfide 
(Boehringer-Mannheim) to concentrations of 6 mM and 1.2 
mM, respectively, (2) diluting the resulting solution 
with five volumes of 20mM Tris-HCl, pH 8.8, 5mM EDTA, and 
(3) letting the solution stand at 4«C for between 4 and 
24 hours. Any precipitate forming during this time was 
removed by centrifugation, and the protein was then 
loaded onto a heparin-Sepharose column (2cm x 2.5cm, 
Pharmacia) equilibrated with 20mM Tris-HCl, pH 7.5, 5mM 
EDTA, 0.1M NaCl. After extensive washing with the 
equilibration buffer, the bound protein was step-eluted 
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first with 20mM Tris-HCl, pH 7.5, 5mM EDTA, 0.6M NaCl, 
and then with 20mM Tris-HCl, pH 7.5, 5mM EDTA, 2M NaCl. 
Final purification of the recombinant HB-EHM was 
accomplished by loading the 2M elute from the heparin- 
5 Sepharose column onto a Vydac c 4 reversed-phase column 
(lcra x 25cm) equilibrated with 15% acetonitrile in 0.1% 
trifluoroacetic acid, and then eluting the bound protein 
with a 40-minute gradient of 15% to 35% acetonitrile in 
0.1% trifluoroacetic acid. Elution of the HB-EHM was 

10 monitored by measuring absorbance of the eluate at 214nm 
and by SDS/polyacrylamide gel electrophoresis of aliquots 
of collected fractions. The HB-EHM was dried under 
vacuum, resuspended in phosphate-buffered saline, and 
shown to be active in the BALB/c 3T3 [ 3 H] -thymidine 

15 uptake assay. 

For expression of Met-HB-EHM, E. coli strain B 
cells harboring pNA51 were used to inoculate M9 salts 
supplemented with 0.4% glucose, 1.1% casamino acids, 2 mM 
Mg 2 S0 4 , and 6.25 /ig/ml tetracycline. The culture was 

20 grown with shaking at 30°C until the OD 600 of the culture 
reached approximately 0.45, at which point 3/3- 
indoleacrylic acid was added to a final concentration of 
30 /Kf/ml to induce the trp promoter. The culture was 
incubated at 30 °C with shaking for 4 hours, and half of 

25 the culture was harvested by centr if ugation . The other 
half of the culture was shaken at 30°C for an additional 
period of about 16 hours (about 20 hours total incubation 
time after the addition of 30-indoleacrylic acid) before 
being harvested by centrif ugation. The cell pellets from 

30 the centrifugations were each resuspended to an 

approximate protein concentration of 10 mg/ml in 0.1M 
Tris-HCl, pH 8.8, 5mM EDTA and were then sonicated to 
lyse the cells. The resulting solutions were centr if uged 
to remove cell debris, and the supernatants were 

35 retained. Since Met-HB-EHM expression was detectable by 
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polyacrylamide gel electrophoresis in lysates of cells 
collected either 4 hours or 20 hours after 30- 
indoleacrylic acid induction, the supernatants from these 
two time points were combined. The resulting supernatant 
solution was loaded onto a heparin-Sepharose column 
equilibrated with 50mM Tris-HCl, pH 7.5, 5mM EDTA, 0.1M 
NaCl. The column was washed with 50mM Tris-HCl, pH 7.5, 
5mM EDTA, 0.6M NaCl until the eluate absorbance was 
reduced to background, at which point, the bound protein 
was eluted with 50mM Tris-HCl, pH 7.5, 5mM EDTA, 1.2M 
NaCl. The eluted protein was dialyzed overnight at 4°C 
against 50 mM Tris-HCl, pH7.5, 5 mH EDTA, and then 
fractionated on a Vydac C 4 RP-HPLC column (0.46cm x 25cm) 
using an elution gradient of 10% - 40% acetonitrile in 
0.1% trifluoroacetic acid. The major peak of protein 
eluting from the column (as monitored by absorbance at 
220nm) was collected and shown both to have the amino- 
terminal amino acid sequence expected for Met-HB-EHM, and 
to be active in the BALB/c 3T3 [ 3 H] -thymidine 
incorporation assay. 
Use 

Growth factors of this invention are useful for 
enhancing the healing of wounds. A wound healing amount 
of a growth factor is readily determined by one of 
ordinary skill in the art using standard techniques, and 
such an amount is applied to the wound by standard 
technique. Such growth factors can be used to treat many 
types of chronic non-healing wounds such as full- 
thickness dermal ulcers, for example, pressure sores, 
venous ulcers, and diabetic ulcers; to treat acute wounds 
such as burns, incisions, and injuries; and to speed the 
healing of wounds associated with reconstructive 
procedures such as skin grafting and flap placement used, 
for example, to repair wounds and for cosmetic purposes. 
In addition, such growth factors can be used to treat 
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damage to the gastric epithelium, the lung epithelium, 
and other internal epithelial layers. For example, such 
growth factors, being resistant to extremes of pH, can be 
used to heal ulcers of the esophagus, stomach, duodenum, 
5 and intestine. Also, because of their epithelial cell- 
stimulatory activity, such growth factors can be used to 
treat eye injuries, for example, corneal ulcers and 
abrasions . 

In cases where the growth factors of this 
10 invention are being used for surface wound healing, they 
may be administered by topical means. In these cases, 
they will be applied directly to the site of injury as 
either a solution, spray, gel, cream, ointment or as a 
dry powder. Slow release devices directing these growth 
15 factors to the injured site will also be used as will the 
combination of such growth factors with topical bandages, 
or dressings, or sutures/staples, and with topical creams 
and ointments, such as the antibacterial Silvadene 
(Marion Labs, Kansas City, MO) , commonly used for the 
20 treatment of injuries. 

For topical use, the growth factors of this 
invention will be used at a concentration ranging from 
50-10,000 ng/ml either in a single application, or in 
dosing regimes that range from several times per day to 
25 once every few days for a period of one to several weeks. 
Usually, the amount of topical formulation administered 
is that which is sufficient to apply about 0.01 to 100 
Mg/cm 2 of growth factor per surface area of the wound. 
For application to injuries of the gastro-intestinal 
30 tract, these growth factors will be administered orally 
or rectally in a suitably buffered solution that may 
contain a carrier compound to protect the protein from 
the acid pH and high protease levels encountered in the 
gastric tract. In addition, these growth factors may be 
35 infused via catheter directly to the site of an internal 
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injury. For opthalmic applications, such growth factors 
may be used in an eye drop form or in an ointment. For 
lung injury, the factor may be administered by inhalation 
of a spray or aerosol. 
5 Growth factors of this invention can also be used 

for the in Vitro culturing of responsive cell types, for 
example, fibroblasts, smooth muscle cells, or epithelial 
cells. For such uses, the growth factors can be added to 
the cell culture medium at a concentration from 10 pg/ml 
io to 100 ng/ml. in addition, cells grown under growth 
factor stimulation can be used as a source of expanded 
cell populations for grafting purposes. For all of these 
applications, the growth factors of this invention may be 
used alone or in combination with other growth factors 
15 and biologically active agents. 

Growth factors of this invention can be used in 
standard protocols for production of polyclonal or 
monoclonal antibodies to those growth factors. Such 
monoclonal antibodies are useful in this invention for 
20 detection of growth factors within the serum of a 

patient. The amount of growth factor detected by such a 
procedure can be compared to the normal level of such 
growth factors in equivalent patients. An elevated level 
of such growth factors (e.g., two or three fold 
25 elevation) is indicative of that patient suffering from 
atherosclerosis. Such patients which suffer from 
atherosclerosis because of an elevated level of growth 
factor can be treated by reducing the level of growth 
factor either by adding antibodies to the growth factor 
30 directly to the serum of the patient or by release of 
such antibodies from a slow release device implanted, 
e.g., in the affected blood vessel (s). Alternatively, 
the level of activity or expression of the growth factor 
may be reduced by addition of antagonists or by use of an 
35 antisense RNA molecule (e.g., a 20 nucleotide sequence 
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complementary to the HB-EHM mRNA in the region spanning 
the translational start site) incorporated into a 
biocompatible sustained-release polymeric device. 

HB-EHM-induced proliferation of target cells 
5 (e.g., fibroblasts, epithelial cells and smooth muscle 
cells) provides an assay for screening candidate 
antagonists for ones which block the growth-stimulatory 
activity of a growth factor of this invention. For 
example, candidate antagonists would be added to cultured 

10 cells together with a growth factor of this invention and 
cell stimulation would be measured by [ 3 H] -thymidine 
incorporation into cellular DNA (as described above) or 
by counting the number of cells after an appropriate 
period of incubation (as described above) . This 

15 measurement would be compared to a control cell sample 
incubated under identical conditions, but to which no 
candidate antagonist was added. Useful antagonists would 
be defined as those molecules which inhibit (i.e., 
decrease the amount of cell proliferation induced by 

20 growth factor stimulation) . These antagonists, once 

identified, would be used to inhibit the in vivo activity 
of a growth factor of this invention. Antagonists may be 
useful, for example, in the treatment of atherosclerosis, 
for blocking post-angioplasty proliferation, or for the 

25 treatment of certain neoplasias or myelofibrosis. The 
antagonist, in this case, could be added directly to the 
blood of the patient, could be incorporated into a slow- 
release device, or if possible, could be administered 
topically to the affected area. Candidate antagonists 

30 may be chosen from any source and would include, without 
limitation, antibodies which preferentially bind to HB- 
EHM or its receptor, peptide fragments of HB-EHM, or 
identified or as yet unidentified drugs. 

In addition, certain tumor cell types may be found 
35 to synthesize the growth factors of this invention. If 
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so, the presence of HB-EHM mRNA or protein may be used as 
a diagnostic for the presence and/ or expansion of certain 
tumor types. Immunoassays (e.g., Western blot or ELISA, 
using an antibody which recognizes a growth factor of 
5 this invention), mRNA-based assays (e.g., Northern blot, 
using as a probe, a nucleic acid encoding a growth factor 
of this invention) , and other assays such as an EGF- 
receptor binding assay would be useful for diagnosing the 
presence of an HB-EHM-producing tumor and for monitoring 
10 its regression upon therapy. 

Other embodiments are within the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

5 

(i) APPLICANT: Klagsbrun, Michael 
Abraham, Judith A. 
Higashiyama, Shigeki 
Besner, Gail F. 

10 

(ii) TITLE OF INVENTION: HEPARIN BINDING MITOGEN WITH 
HOMOLOGY TO EGF 

(iii) NUMBER OF SEQUENCES: 18 

15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson 

(B) STREET: 225 Franklin street 

(C) CITY: Boston 

20 (D) STATE: Massachusetts 

(E) COUNTRY: US 

(F) ZIP: 02110-2804 

(V) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.24 

30 (Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Vii) PRIOR APPLICATION DATA: 
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(A) APPLICATION NUMBER: 07/598,082 

(B) FILING DATE: 16-OCT-1990 

(Viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Freeman, John W. 

(B) REGISTRATION NUMBER: 29066 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 617-542-5070 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 262.. 885 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:l: 

GCTACGCGGG CCACGCTGCT GGCTGGCCTG ACCTAGGCGC QCGGGSTC6G GCGGCCGCGC 60 

GGGCGGGCTG AGTGAGCAAG ACAAGACACT CAAGAAGAGC GAGCTGCGCC TGGGTCCCGG 120 

CCAGGCTTGC ACGCAGAGGC GGGCGGCAGA CGGTGCCCGG CGGAATCTCC TGAGCTCCGC 180 

CGCCCAGCTC TGGTGCCAGC GCCCAGTGGC CGCCGCTTCG AAAGTGACTG GTGCCTCGCC 240 

GCCTCCTCTC GGTGCGGGAC C ATG AAG CTG CTG CCG TCG GTG GTG CTG AAG 291 
Met Lys Leu Leu Pro Ser Val Val Leu Lys 
1 5 10 

CTC TTT CTG GCT GCA GTT CTC TCG GCA CTG GTG ACT GGC GAG AGC CTG 339 
Leu Phe Leu Ala Ala Val Leu Ser Ala Leu Val Thr Gly Glu Ser Leu 
15 20 25 

GAG CGG CTT CGG AGA GGG CTA GCT GCT GGA ACC AGC AAC CCG GAC CCT 387 
Glu Arg Leu Arg Arg Gly Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro 
30 35 40 

CCC ACT GTA TCC ACG GAC CAG CTG CTA CCC CTA GGA GGC GGC CGG GAC 435 
Pro Thr Val Ser Thr Asp Gin Leu Leu Pro Leu Gly Gly Gly Arg Asp 
45 50 55 

CGG AAA GTC CGT GAC TTG CAA GAG GCA GAT CTG GAC CTT TTG AGA GTC 483 
Arg Lys Val Arg Asp Leu Gin Glu Ala Asp Leu Asp Leu Leu Arg Val 
60 65 70 

ACT TTA TCC TCC AAG CCA CAA GCA CTG GCC ACA CCA AAC AAG GAG GAG 531 
Thr Leu Ser Ser Lys Pro Gin Ala Leu Ala Thr Pro Asn Lys Glu Glu 
75 80 85 90 
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CAC GGG AAA AGA AAO AAO AAA G6C AAG GGG CTA GGG AAG AAG AGO GAC 579 
His Gly Lys Arg Lya Lys Lys Gly -Lys Gly Leu Gly Lys Lys Arg Asp 
95 100 105 

CCA TGT CTT CGG AAA TAC AAG GAC TTC TGC ATC CAT GGA GAA TGC AAA 627 
Pro Cys Leu Arg Lys Tyr Lys Asp Phe Cys lie His Gly Glu Cys Lys 
110 115 120 

TAT GTG AAG GAG CTC CGG GCT CCC TCC TGC ATC TGC CAC CCG GGT TAC 675 
Tyr Val Lys Glu Leu Arg Ala Pro Ser Cys lie Cys His Pro Gly Tyr 
125 130 135 

CAT GGA GAG AGG TGT CAT GGG CTG AGC CTC CCA GTG GAA AAT CGC TTA 723 
His Gly Glu Arg Cys His Gly Leu Ser Leu Pro Val Glu Asn Arg Leu 
140 145 150 

TAT ACC TAT GAC CAC ACA ACC ATC CTG GCC GTG GTG GCT GTG GTG CTG 771 
Tyr Thr Tyr Asp His Thr Tbr lie Leu Ala Val Val Ala Val Val Leu 
155 160 165 170 

TCA TCT GTC TGT CTG CTG GTC ATC GTG GGG CTT CTC ATG TTT AGG TAC 819 
Ser Ser Val Cys Leu Leu Val He Val Gly Leu Leu Met Phe Arg Tyr 
175 180 185 

CAT AGG AGA GGA GGT TAT GAT GTG GAA AAT GAA GAG AAA GTG AAG TTG 867 
His Arg Arg Gly Gly Tyr Asp Val Glu Asn Glu Glu Lys Val Lys Leu 
190 195 200 

GGC ATG ACT AAT TCC CAC TGAGAGAGAC TTGTGCTCAA GGAATCGGCT 915 
Gly Met Thr Asn Ser His 
205 

GGGGACTGCT ACCTCTGAGA AGACACAAGG TGATTTCAGA CTGCAGAGGG GAAAGACTTC 975 
CATCTAGTCA CAAAGACTCC TTCGTCCCCA GTTGCCGTCT AGGATTGGGC CTCCCATAAT 1035 
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TGCTTTGCCA AAATACCAGA GCCTTCAAGT GCCAAACAGA GTATGTCCGA TGGTATCTGG 1095 

GTAAGAAGAA AGCAAAAGCA AGGGACCTTC ATGCCCTTCT GATTCCCCTC CACCAAACCC 1155 

CACTTCCCCT CATAAGTTTG TTTAAACACT TATCTTCTGG ATTAGAATGC CGGTTAAATT 1215 

CCATATGCTC CAGGATCTTT GACTGAAAAA AAAAAAGAAG AAGAAGAAGG AGAGCAAGAA 1275 

GGAAAGATTT GTGAACTGGA AGAAAGCAAC AAAGATTGAG AAGCCATGTA CTCAAGTACC 1335 

ACCAAGGGAT CTGCCATTGG GACCCXCCAG TGCTGGATTT GATGAGTTAA CTGTGAAATA 1395 

CCACAAGCCT GAGAACTGAA TTTTGGGACT TCTACCCAGA TGGAAAAATA ACAACTATTT 1455 

TTGTTGTTGT TGTTTGTAAA TGCCTCTTAA ATTATATATT TATTTTATTC TATGTATGTT 1515 

AATTTATTTA GTTTTTAACA ATCTAACAAT AATATTTCAA GTGCCTAGAC TGTTACTTTG 1575 

GCAATTTCCT GGCCCTCCAC TCCTCATCCC CACAATCTGG CTTAGTGCCA CCCACCTTTG 1635 

CCACAAAGCT AGGATGGTTC TGTGACCCAT CTGTAGTAAT TTATTGTCTG TCTACATTTC 1695 

TGCAGATCTT CCGTGGTCAG AGTGCCACTG CGGGAGCTCT GTATGGTCAG GATGTAGGGG 1755 

TTAACTTGGT CAGAGCCACT CTATGAGTTG GACTTCAGTC TTGCCTAGGC GATTTTGTCT 1815 

ACCATTTGTG TTTTGAAAGC CCAAGGTGCT GATGTCAAAG TGTAACAGAT ATCAGTGTCT 1875 

CCCCGTGTCC TCTCCCTGCC AAGTCTCAGA AGAGGTTGGG CTTCOATGCC TGTAGCTTTC 1935 

CTGGTCCCTC ACCCCCATGG CCCCAGGCCA CAGCGTGGGA ACTCACTTTC CCTTGTGTCA 1995 

AGACATTTCT CTAACTCCTG CCATTCTTCT GGTGCTACTC CATGCAGGGG TCAGTGCAGC 2055 
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AGAGGACAGT CTGGAGAAGG TATTAGCAAA GCAAAAGGCT GAGAAGGAAC AGGGAACATT 2115 " 

GGAGCTGACT GTTCTTGGTA ACTGATTACC TGCCAATTGC TACCGAGAAG GTTGGAGGTG 2175 

GGGAAGGCTT TGTATAATCC CACCCACCTC ACCAAAACGA TGAAGGTATG CTGTCATGGT 2235 

CCTTTCTGGA AGTTTCTGGT GCCATTTCTG AACTGTTACA ACTTGTATTT CCAAACCTGG 2295 

TTCATATTTA TACTTTGCAA TCCAAATAAA GATAACCCTT ATTCCATAAA AAAAAAAAAA 2355 

AAAAA 2 3 60 

(2) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Leu Leu Pro Ser Val Val Leu Lys Leu Phe Leu Ala Ala Val 
15 10 15 

Leu Ser Ala Leu Val Thr Gly Glu Ser Leu Glu Arg Leu Arg Arg Gly 
20 25 30 

Leu Ala Ala Gly Thr Ser Asn Pro Asp Pro Pro Thr Val Ser Thr Asp 
35 40 45 

Gin Leu Leu Pro Leu Gly Gly Gly Arg Asp Arg Lys Val Arg Asp Leu 
50 55 60 
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Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Thr Leu Ser Ser Lye Pro 
65 70 75 80 

Gin Ala Leu Ala Thr Pro Aon Lys Glu Glu His Gly Lys Arg Lys Lys 
85 90 95 

Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg Lys Tyr 
100 105 110 

Lys Asp Phe Cys lie His Gly Glu Cys Lys Tyr Val Lys Glu Leu Arg 
115 120 125 

Ala Pro Ser Cys lie Cys His Pro Gly Tyr His Gly Glu Arg Cys His 
130 135 140 

Gly Leu Ser Leu Pro Val Glu Asn Arg Leu Tyr Thr Tyr Asp His Thr 
"5 150 155 160 

Thr He Leu Ala Val Val Ala Val Val Leu Ser Ser Val Cys Leu Leu 
165 170 175 

Val He Val Gly Leu Leu Met Phe Arg Tyr His Arg Arg Gly Gly Tyr 
180 185 190 

Asp Val Glu Asn Glu Glu Lys Val Lys Leu Gly Met Thr Asn Ser His 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPEs amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY t linear 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 J 

Val Xaa Leu Ser Ser Lys Pro Gin Ala Leu Ala Xaa Pro Asn Lys Glu 
15 10 15 

Glu His Gly Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Arg Val Xaa Leu Ser Ser Lys Pro Gin Ala Leu Ala Xaa Pro Asn Lys 
15 10 15 

Glu Glu His Gly Lys Arg LyB Lys Lya Gly Lys Gly Leu Gly Lys Lys 
20 25 30 

Arg Asp Pro Xaa Leu Arg Lys Tyr Lys Asp Phe Xaa lie His Gly Glu 
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35 40 45 

Xaa Xaa Tyr 
50 

(2) INFORMATION FOR SEQ ID NO* 5s 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 amino acids 

(B) TYPE i amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGYs linear 

(ii) MOLECULE TYPE* protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 t 

Arg val Xaa Leu Ser Ser Lys Pro Gin Ala Leu Ala Xaa Pro Asn Lys 
15 10 15 

Glu Glu 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 6: 

Ser Ser Lys Pro Gin Ala Leu Ala Xaa Xaa Asn Xaa Glu Glu 
IS 10 

(2) INFORMATION FOR SEQ ID NO: 7s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Leu Ala Xaa Xaa Asn Lys Xaa Glu Xaa Gly Lys Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Arg Val Xaa Leu Ser ser Lys Pro 61n Ala Leu Ala Xaa Pro Asn Lys 
1 5 10 15 

Glu 61u His Oly Lys Arg Lys Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

Xaa Xaa Lys Pro Gin Ala Leu Ala Xaa Xaa Asn Xaa Glu 
1 5 io 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Ala Leu Ala Xaa Pro Asn Lya Glu Glu Xaa Gly Lya Arg 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Nucleotide number 4 may be T." 
(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /note= "Nucleotide number 22 may be G" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 40 

(D) OTHER INFORMATION: /note- "Nucleotide number 40 may be G." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTTGCCATGC TCCTCCTTGT TAGGCTTGGC CAGGGCCTGA GGCTT 



45 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "Nucleotide number 7 may be 6." 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCCGCCACCA TGG i: 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cye Asn Ala Glu Phe Gin Asn Phe Cys lie His Gly Glu Cys Lys Tyr 
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15 10 15 

He Glu His Leu Glu Ala Val Thr Cys Lys Cys Gin Gin Glu Tyr Phe 
20 25 30 

Gly Glu Arg Cys 
35 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY j linear 

(ii) MOLECULE TYPE I protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp Gly Val Cys Met 
15 10 15 

Tyr He Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val Val Gly Tyr 
20 25 30 

He Gly Glu Arg Cys 
35 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Pro Asp Ser His Thr Gin Phe Cys Phe His Gly Thr Cys Arg Phe 
1 5 io 15 

Leu Val Gin Glu Asp Lys Pro Ala Cys Val Cys His Ser Gly Tyr Val 
20 25 30 

Gly Ala Arg Cys 
35 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Asp Leu Gin Glu Ala Asp Leu Asp Leu Leu Arg Val Xaa Leu Xaa Ser 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 base pairs 

(B) TYPE j nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CATATGGAGA AAAAAATCAC TGGATATACC ACCGTTGATA TATCCCAATA TCATCGTAAA 60 

GAACATTTTG AGGCATTTCA GTCAGTTGCT CAATCAACCT ATAACCA6AC CGTTCAGCTG 120 

GATATTACGG CCTTTTTAAA GACCGTAAAG AAAAATAAGC ACAAGTTTTA TCCGGCCTTT 180 

ATTCACATTC TTGCCCGCCT GCTGAATGCT CATCCGGAAT TCATGAGAGT CACTTTATCC 240 

TCCAAGCCAC AAGCACTGGC CACACCAAAC AAGGAGGAGC ACGGGAAAAG AAAGAAGAAA 300 

GGCAAGGGGC TAGGGAAGAA GAGGGACCCA TGTCTTCGGA AATACAAGGA CTTCTGCATC 360 

CATGGAGAAT GCAAATATGT GAAGGAGCTC CGGGCTCCCT CCTGCATCTG CCACCCGGGT 420 

TACCATGGAG AGAGGTGTCA TGGGCTGAGC TAAGCTT 457 

(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Met Glu Lys Lys He Thr Gly Tyr Thr Thr Val Asp He Ser Gin Tyr 

5 10 15 

His Arg Lys Glu His Phe Glu Ala Phe Gin Ser Val Ala Gin Ser Thr 

20 25 30 

Tyr Asn Gin Thr Val Gin Leu Asp He Thr Ala Phe Leu Lys Thr Val 
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35 40 45 

Lys Lys Asn Lys His Lys Phe Tyr Pro Ala Phe lie His He Leu Ala 

50 55 60 

Arg Leu Leu Asn Ala His Pro Glu Phe Met Arg Val Thr Leu Ser Ser 
65 70 75 BO 

Lys Pro Gin Ala Leu Ala Thr Pro Asn Lys Glu Glu His Gly Lys Arg 

85 90 95 

Lys Lys Lys Gly Lys Gly Leu Gly Lys Lys Arg Asp Pro Cys Leu Arg 

100 105 no 

Lys Tyr Lys Asp Phe Cys He His Gly Glu Cys Lys Tyr Val Lys Glu 

115 120 125 

Leu Arg Ala Pro Ser Cys He Cys His Pro Gly Tyr His Gly Glu Arg 
130 135 140 



Cys His Gly Leu Ser 
145 
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Claims 

1 1. A composition of matter comprising an isolated 

2 heparin binding EGF-homologous mitogen (HB-EHM) . 

1 2. A composition of matter comprising an isolated 

2 polypeptide which binds heparin, which comprises an EGF- 

3 homologous segment, and which stimulates growth of 

4 fibroblast cells, epithelial cells, and smooth muscle 

5 cells, but not endothelial cells. 

1 3. The composition of matter of claims 1 or 2, 

2 wherein said composition of matter comprises human HB- 

3 EHM. 

1 4. The composition of matter of claims l or 2, 

2 comprising substantially amino acids 108 to 143 of the 

3 sequence listed in SEQ ID NO:l: 

4 CLRKYKDFCIHGECKYVKELRAPSCICHP 

5 GYHGERC. 

1 5. The composition of matter of claims l or 2, 

2 comprising all or part of amino acids l to 208 of SEQ ID 

3 NO:i: 

4 MKLLPSVVLKLFLAAVLSALVTGESLERL 

5 RRGLAAGTSNPDPPTVSTDQLLPLGGGRD 

6 RKVRDLQEADLDLLRVTLSSKPQALATPN 

7 KEEHGKRKKKGKGLGKKRDPCLRKYKDFC 

8 IHGECKYVKELRAPSCI CHPGYHGERCHG 

9 LSLPVENRLYTYDHTTILAVVAVVLSSVC 

10 LLVIVGLLMFRYHRRGGYDVENEEKVKLG 

11 M T N S H. 



WO 92/06705 



PCT/US91/07691 



- 65 - 

1 6. The composition of matter of claims l or 2, 

2 wherein the composition of matter comprises mature HB- 

3 EHM. 

1 7. The composition of matter of claims 1 or 2, 

2 wherein the amino-terminus of the HB-EHM is between 

3 aspartic acid residue 63 and alanine residue 82 as shown 

4 in Fig. 3 (SEQ ID NO: 1) . 

1 8. The composition of matter of claim 7, wherein 

2 the amino-terminus of the HB-EHM is between arginine 

3 residue 73 and alanine residue 82 as shown in Fig. 3 (SEQ 

4 ID NO:l) . 

1 9. The composition of matter of claims l or 2, 

2 wherein the carboxy-terminus of HB-EHM is between serine 

3 residue 147 and proline residue 149 as shown in Fig. 3 

4 (SEQ ID NO: l) . 

1 10. The composition of matter of claims l or 2, 

2 said composition of matter being acid stable. 

1 11. The composition of matter of claims l or 2, 

2 said composition of matter having a pi of between 7.2 and 

3 7.8. 

1 12. The composition of matter of claim 1, wherein 

2 said composition of matter comprises a purified 

3 polypeptide comprising an amino acid sequence 

4 substantially identical to the amino acid sequence shown 

5 in Fig. 4 (SEQ ID NO: 1) . 

1 13. The composition of matter of claim 1, wherein 

2 said composition of matter comprises a purified 

3 polypeptide comprising an amino acid sequence 
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4 substantially identical to the amino acid sequence shown 

5 in Fig. l (SEQ id NO:l). 

1 14. The composition of matter of claims l, 2, 12, 

2 or 13, said composition of matter being glycosylated. 

1 15. The composition of matter of claims 12 or 13, 

2 said composition of matter having an apparent molecular 

3 weight of between 19 kD and 24 kD on a non-reducing 

4 polyacrylamide gel. 



16. The composition of matter of claim 12, said 
composition of matter comprising at least 86 amino acid 



2 



3 residues. 



1 17. The composition of matter of claim 13, said 

2 composition of matter comprising at least 66 amino acid 

3 residues. 



1 18. The composition of matter of claims l, 2, 12, 

2 or 13, said composition of matter being un-glycosylated. 

1 19. The composition of matter of claims l, 2, 12, 

2 or 13, said composition of matter being cationic. 

1 20. The composition of matter of claims l, 2, 12, 

2 or 13, said composition of matter being sufficiently 

3 isolated from other co-purifying substances to be 

4 suitable for human therapeutic use. 

21. Purified nucleic acid which encodes the 
composition of matter of claims l, 2, 12, or 13. 

22. A vector containing the nucleic acid of claim 

21. 
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1 23, The vector of claim 22, said vector being, the 

2 plasmid pMTN-HBEGF , deposited with the ATCC and 

3 designated ATCC Accession No. 40900. 

1 24. The vector of claim 22, said vector being the 

2 plasmid pAX-HBEGF, deposited with the ATCC and designated 

3 ATCC Accession No. 40899. 

1 25. The vector of claim 22, said vector being the 

2 plasmid pNA28, deposited with the ATCC and designated 

3 ATCC Accession No. 



1 26. The vector of claim 22, said vector being the 

2 plasmid pNA51, deposited with the ATCC and designated 

3 ATCC Accession No. . 



1 27. a cell which contains the nucleic acid of 

2 claim 21. 

1 28. The cell of claim 27, said cell being a 

2 eukaryotic cell. 

1 29. The cell of claim 28, said cell being a 

2 mammalian cell. 

1 30. The eukaryotic cell of claim 28, wherein said 

2 eukaryotic cell is capable of secreting a mature, 

3 glycosylated form of said composition of matter into the 

4 medium in which said cell is grown. 

1 31. The cell of claim 27, said cell being a 

2 prokaryotic cell. 
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1 32. The cell of claim 31, said prokaryotic cell 

2 being Escherichia coli. 

1 33. The cell of claim 31, said prokaryotic cell 

2 being E. coli B or E. coli W3110. 

1 34. A method of producing the composition of 

2 matter of claims 1, 2, 12 or 13, comprising 

3 introducing the vector of claim 22 into an 

4 appropriate host cell under conditions which permit 

5 expression of the composition of matter, 

6 expressing the composition of matter, and 

7 isolating the composition of matter. 

1 35. A method for producing the composition of 

2 matter of claims l, 2, 12, or 13, comprising 

3 culturing the eukaryotic cell of claim 28 or the 

4 prokaryotic cell of claim 31 under conditions which 

5 permit expression of the composition of matter, and 

6 isolating the composition of matter. 

1 36. The method of claim 34, wherein said cell is 

2 a prokaryotic cell and said vector encodes a mature form 

3 of said composition of matter. 

1 37. The method of claim 34, wherein said cell is 

2 a eukaryotic cell and said vector encodes a mature form 

3 of said composition of matter. 

1 38. The method of claim 34, wherein said cell is 

2 a eukaryotic cell and said composition of matter is 

3 isolated from the medium in which the cells are grown. 
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39. The method of claim 34, wherein said 
composition of matter comprises an additional amino- 



2 

3 terminal methionine residue 



1 40. A pharmaceutical composition for wound 



healing comprising an effective wound healing amount of 
the composition of matter of claims l, 2, 12, or 13 in a 
pharmaceutically-acceptable carrier. 



l 41. The pharmaceutical composition of claim 40 



wherein said pharmaceutical composition is in the form of 
3 a solution, gel, cream, ointment or dry powder. 

1 42. A method for healing a wound in a patient 

2 comprising applying to the wound a wound healing amount 

3 of the composition of matter of claim 40. 

1 43. A method for the in vitro culturing of a cell 

2 whose growth is stimulated by HB-EHM, said method 

3 comprising contacting said cells with a growth- 
stimulatory amount of the composition of matter of claims 
l, 2, 12, or 13. 



1 44. The method of claim 43, wherein said cells 

2 are fibroblasts, epithelial cells, or smooth muscle 

3 cells. 

1 45. A purified antibody which binds 

2 preferentially to the composition of matter of claims l, 

3 2, 12, or 13. 

1 46. The purified antibody of claim 45, said 

2 antibody being monoclonal antibody. 
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1 47. The antibody of claim 45, wherein said 

2 antibody neutralizes the biological activity in vivo of 

3 HB-EHM. 

1 48. A method for identifying an antagonist to HB- 

2 EHM, said method comprising 

3 providing HB-EHM to cultured cells, whose 

4 proliferation is stimulated by said factor, in the 

5 presence of a candidate antagonist, and 

6 identifying said antagonist as a candidate 

7 antagonist which is capable of blocking HB-EHM- induced 

8 proliferation of said cells. 
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